ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-05-31 19:16:41 +00:00

Author	SHA1	Message	Date
Niels de Vos	6d00b39886	cleanup: move log functions to new internal/util/log package Moving the log functions into its own internal/util/log package makes it possible to split out the humongous internal/util packages in further smaller pieces. This reduces the inter-dependencies between utility functions and components, preventing circular dependencies which are not allowed in Go. Updates: #852 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-26 09:34:05 +00:00
Prasanna Kumar Kalever	b6a88dd728	rbd: add volume healer Problem: ------- For rbd nbd userspace mounter backends, after a restart of the nodeplugin all the mounts will start seeing IO errors. This is because, for rbd-nbd backends there will be a userspace mount daemon running per volume, post restart of the nodeplugin pod, there is no way to restore the daemons back to life. Solution: -------- The volume healer is a one-time activity that is triggered at the startup time of the rbd nodeplugin. It navigates through the list of volume attachments on the node and acts accordingly. For now, it is limited to nbd type storage only, but it is flexible and can be extended in the future for other backend types as needed. From a few feets above: This solves a severe problem for nbd backed csi volumes. The healer while going through the list of volume attachments on the node, if finds the volume is in attached state and is of type nbd, then it will attempt to fix the rbd-nbd volumes by sending a NodeStageVolume request with the required volume attributes like secrets, device name, image attributes, and etc.. which will finally help start the required rbd-nbd daemons in the nodeplugin csi-rbdplugin container. This will allow reattaching the backend images with the right nbd device, thus allowing the applications to perform IO without any interruptions even after a nodeplugin restart. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	874f6629fb	rbd: get default plugin path Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Yati Padia	696ee496fc	cleanup: resolves gofumpt for cmd This commit resolves gofumpt linter for cmd folder Updates: 1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-14 17:19:00 +00:00
Yati Padia	4a649fe17f	cleanup: resolve godot linter This commit resolves godot linter issue which says "Comment should end in a period (godot)". Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-13 06:50:03 +00:00
Humble Chirammal	7409714103	cephfs: reformat long lines in 'cmd' package to 120 chars We have many declarations and invocations..etc with long lines which are very difficult to follow while doing code reading. This address the issues in 'cmd' package to restrict the line length to 120 chars. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-06-28 14:43:49 +00:00
Yati Padia	774e8e4042	util: enable golang profiling Add support for golang profiling. Standard tools like go tool pprof and curl work. example: $ go tool pprof http://localhost:8080/debug/pprof/profile $ go tool pprof http://localhost:8080/debug/pprof/heap $ curl http://localhost:8080/debug/pprof/heap?debug=1 https://golang.org/pkg/net/http/pprof/ contains more details about the pprof interface. Fixes: #1699 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-05-25 10:41:22 +00:00
Rakshith R	7d105143fc	cleanup: refactor deeply nested if statement in cephcsi.go Refactored deeply nested if statement in cephcsi.go to reduce cognitive complexity. Signed-off-by: Rakshith R <rar@redhat.com>	2021-04-07 02:31:41 +00:00
Humble Chirammal	82bc993b32	util: remove unwanted import string from module dependencies There is no need for an extra import string when the go mod package itself declared in the same. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-03-24 15:12:13 +00:00
Madhu Rajanna	30af703a2f	rbd: add controller to main initialize and start the rbd controller when we the driver type is controller. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	8d3a44d0c4	rbd: add minsnapshotsonimage flag An rbd image can have a maximum number of snapshots defined by maxsnapshotsonimage On the limit is reached the cephcsi will start flattening the older snapshots and returns the ABORT error message, The Request comes after this as to wait till all the images are flattened (this will increase the PVC creation time. Instead of waiting till the maximum snapshots on an RBD image, we can have a soft limit, once the limit reached cephcsi will start flattening the task to break the chain. With this PVC creation time will only be affected when the hard limit (minsnapshotsonimage) reached. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-18 05:59:20 +00:00
Niels de Vos	ecc33a9f86	cleanup: no need to validate conf.Vtype twice conf.Vtype was verified already, no need to do it a 2nd time. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	4c91f07c78	cleanup: do not panic when validateMaxSnaphostFlag() detects an error When the cephcsi executable detects an error when calling validateMaxSnaphostFlag(), it panics due to klog.Fatalln(). The error that validateMaxSnaphostFlag() logs should be understandable enough, so that users know what to investigate. A Go panic on a user error is not very userfriendly, and does not provide any additional usefil information. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	3e305970df	cleanup: do not panic when validateCloneDepthFlag() detects an error When the cephcsi executable receives an error when calling validateCloneDepthFlag(), it panics due to klog.Fatalln(). The errors that validateCloneDepthFlag() logs should be understandable enough, so that users know what to investigate. A Go panic on a user error is not very userfriendly, and does not provide any additional usefil information. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	86a8d29bd1	cleanup: do not panic when the metricspath is not a valid URL When the cephcsi executable receives an error when calling util.ValidateURL() on the optional "metricspath". The error that util.ValidateURL() returns should be understandable enough, so that users know what to investigate. A Go panic on a user error is not very userfriendly, and does not provide any additional usefil information. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	23817c1a83	cleanup: do not panic on invalid drivername When the cephcsi executable receives an error when calling util.ValidateDriverName(), it panics due to klog.Fatalln(). The error that util.ValidateDriverName() returns should be understandable enough, so that users know what to investigate. A Go panic on a user error is not very userfriendly, and does not provide any additional usefil information. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	79e91fa894	cleanup: prevent Go panic on missing driver type When running the 'cephcsi' executable without arguments, a Go panic is reported: $ ./_output/cephcsi F1026 13:59:04.302740 3409054 cephcsi.go:126] driver type not specified goroutine 1 [running]: k8s.io/klog/v2.stacks(0xc000010001, 0xc0000520a0, 0x48, 0x9a) /go/src/github.com/ceph/ceph-csi/vendor/k8s.io/klog/v2/klog.go:996 +0xb9 k8s.io/klog/v2.(loggingT).output(0x2370360, 0xc000000003, 0x0, 0x0, 0xc000194770, 0x20cb265, 0xa, 0x7e, 0x413500) /go/src/github.com/ceph/ceph-csi/vendor/k8s.io/klog/v2/klog.go:945 +0x191 k8s.io/klog/v2.(loggingT).println(0x2370360, 0x3, 0x0, 0x0, 0xc000163e08, 0x1, 0x1) /go/src/github.com/ceph/ceph-csi/vendor/k8s.io/klog/v2/klog.go:699 +0x11a k8s.io/klog/v2.Fatalln(...) /go/src/github.com/ceph/ceph-csi/vendor/k8s.io/klog/v2/klog.go:1456 main.main() /go/src/github.com/ceph/ceph-csi/cmd/cephcsi.go:126 +0xafa Just logging the error and exiting should be sufficient. This stack-trace from the Go panic does not add any useful information. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Niels de Vos	7101a6dc8e	cleanup: add logAndExit() for cephcsi:main() to call instead of panic The main() function of the cephcsi executable calls klog.Fatalln() to report certain errors. This causes the executable to panic which is not helpful to users that only need the error message. By introducing logAndExit(), there is no need to call klog.Fatalln() anymore. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-28 09:37:36 +00:00
Yug	71ddf51544	cleanup: address gomnd warnings Direct usage of numbers should be avoided. Issue reported: mnd: Magic number: X, in <argument> detected (gomnd) Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Madhu Rajanna	d15ded88f5	cleanup: Remove support for Delete and Unmounting v1.1.0 PVC as v1.0.0 is deprecated we need to remove the support for it in the Next coming (v3.0.0) release. This PR removes the support for the same. closes #882 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-10 16:07:13 +00:00
Yug	1490daed7e	cleanup: Avoid usage of numbers Add seperate functions to handle all levels and types of logging. Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-10 07:41:23 +00:00
Yug	8dc4ab6b1b	rebase: update k8s.io/klog to v2.3.0 Update klog version to v2.3.0 Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-10 07:41:23 +00:00
Madhu Rajanna	8ef7143e6c	rbd: add maxsnapshotsonimage flag Added maxsnapshotsonimage flag to flatten the older rbd images on the chain to avoid issue in krbd.The limit is in krbd since it only allocate 1 4KiB page to handle all the snapshot ids for an image. The max limit is 510 as per https://github.com/torvalds/linux/blob/ aaa2faab4ed8e5fe0111e04d6e168c028fe2987f/drivers/block/rbd.c#L98 in cephcsi we arekeeping the default to 450 to reserve 10% to avoid issues. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-06 10:08:31 +00:00
Madhu Rajanna	b085577a4f	rbd: add skipForceFlatten flag added skipForceFlatten flag to skip the image deptha and skip image flattening. This will be very useful if the kernel is not listed in cephcsi which supports deep flatten fauture. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-01 08:21:47 +00:00
Madhu Rajanna	9b518726ab	rbd: add hardlimt and softlimit flag added Hardlimit and Softlimit flags for cephcsi arguments. When the Softlimit is reached cephcsi will start a background task to flatten the rbd image and return success and if the hardlimit is reached it will start a background task to flatten the rbd image and return ready to use as false to make sure that the image will not be used until it is flatten. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-01 08:21:47 +00:00
Madhu Rajanna	567b2ab280	util: Rename KernelVersion to GetKernelVersion Renames KernelVersion to GetKernelVersion for more readibility. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-01 08:21:47 +00:00
Niels de Vos	1b0b26a7a6	util: log kernel version on start It is useful to have the kernel version logged while starting binaries. Some functionality depends on the version of the kernel, debugging issues related to this will be easier. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-06-18 11:42:47 +00:00
Niels de Vos	32839948ef	cleanup: move pkg/ to internal/ The internal/ directory in Go has a special meaning, and indicates that those packages are not meant for external consumption. Ceph-CSI does provide public APIs for other projects to consume. There is no plan to keep the API of the internally used packages stable. Closes: #903 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-04-23 11:00:59 +00:00
Madhu Rajanna	5b14cc9272	Fix goimports issue in CI Fix below error in current codebase File is not `goimports`-ed with -local github.com/ceph/ceph-csi Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-15 09:18:47 +00:00
ShyamsundarR	5c4abf8347	Add topology support to ceph-csi Signed-off-by: ShyamsundarR <srangana@redhat.com>	2020-04-14 14:14:29 +00:00
Madhu Rajanna	bcd646ee55	Deprecate grpc metrics in ceph-csi As kubernetes CSI sidecar is exposing the GRPC mertics we can make use of the same in ceph-csi we dont need to expose our own. update: #881 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-01 11:59:37 +00:00
xu.chen	399f0b0d89	Audit log and follow klog standard	2020-03-27 09:24:52 +00:00
Madhu Rajanna	034b123478	Remove mount cache for cephfs PR #282 introduces the mount cache to solve cephfs fuse mount issue when cephfs plugin pod restarts .This is not working as intended. This PR removes the code for maintainability. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-02-11 15:11:21 +00:00
Woohyung Han	8a16f740d6	Update golangci-lint version to v1.21.0 Signed-off-by: Woohyung Han <techhanx@gmail.com>	2019-12-12 04:57:14 +00:00
Madhu Rajanna	118f34525e	Remove deprecated containerized As we are moving towards v2.0.0 I think it's a good time to remove the deprecated flag. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-12-11 14:44:35 +00:00
Madhu Rajanna	dfc3562e29	Add Version flag to cephcsi This will be helpful if someone wants to check the cephcsi version output ``` docker run quay.io/cephcsi/cephcsi:v1.2.1 --version Cephcsi Version: v1.2.1 Git Commit: 4b871366327d63e27fc1abfb699f0faaf0fc16b9 GoVersion: go1.12.5 Compiler: gc Platform: linux/amd64 ``` Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-11-06 12:35:28 +00:00
Stefan Haas	6a2717ce20	Added forcecephkernelclient as startup parameter to force enabling ceph Signed-off-by: Stefan Haas <shaas@suse.com>	2019-10-16 06:47:10 +00:00
Madhu Rajanna	7002da1a6a	deprecate containerized flag and remove nsenter Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-09-23 13:22:29 +00:00
Madhu Rajanna	f4b38228ae	Remove volumemounter flag from cephfs Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-09-05 07:20:50 +00:00
Madhu Rajanna	a81a3bf96b	implement grpc metrics for ceph-csi Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-30 06:50:32 +00:00
Madhu Rajanna	0da4bd5151	start controller or node server based on config if both controller and nodeserver flags are set/unset cephcsi will start both server, if only one flag is set, it will start relavent service. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
Madhu Rajanna	89732d923f	move flag configuration variable to util remove unwanted checks remove getting drivertype from binary name Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
Daniel-Pivonka	d621a58207	prometheus liveness probe sidecar Signed-off-by: Daniel-Pivonka dpivonka@redhat.com	2019-08-13 17:51:41 +00:00
Niels de Vos	31648c8feb	provisioners: add reconfiguring of PID limit The container runtime CRI-O limits the number of PIDs to 1024 by default. When many PVCs are requested at the same time, it is possible for the provisioner to start too many threads (or go routines) and executing 'rbd' commands can start to fail. In case a go routine can not get started, the process panics. The PID limit can be changed by passing an argument to kubelet, but this will affect all pids running on a host. Changing the parameters to kubelet is also not a very elegant solution. Instead, the provisioner pod can change the configuration itself. The pod is running in privileged mode and can write to /sys/fs/cgroup where the limit is configured. With this change, the limit is configured to 'max', just as if there is no limit at all. The logs of the csi-rbdplugin in the provisioner pod will reflect the change it makes when starting the service: $ oc -n rook-ceph logs -c csi-rbdplugin csi-rbdplugin-provisioner-0 .. I0726 13:59:19.737678 1 cephcsi.go:127] Initial PID limit is set to 1024 I0726 13:59:19.737746 1 cephcsi.go:136] Reconfigured PID limit to -1 (max) .. It is possible to pass a different limit on the commandline of the cephcsi executable. The following flag has been added: --pidlimit=<int> the PID limit to configure through cgroups This accepts special values -1 (max) and 0 (default, do not reconfigure). Other integers will be the limit that gets configured in cgroups. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2019-08-13 14:43:29 +00:00
Humble Chirammal	0786225937	Implement metrics for RBD plugin Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-08-01 11:58:54 +00:00
Madhu Rajanna	778cfb3090	provide option to set pluginpath for cephfs Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-25 14:47:42 +05:30
Madhu Rajanna	3f8bd3b2a6	Update driver version during build time update driver version and add git commit to the image. This will help us to identify what latest git commit image contains. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-12 15:54:52 +05:30
ShyamsundarR	fa68c35f3b	Support mounting and deleting version 1.0.0 RBD volumes This commit adds support to mount and delete volumes provisioned by older plugin versions (1.0.0) in order to support backward compatibility to 1.0.0 created volumes. It adds back the ability to specify where older meta data was specified, using the metadatastorage option to the plugin. Further, using the provided meta data to mount and delete the older volumes. It also supports a variety of ways in which monitor information may have been specified (in the storage class, or in the secret), to keep the monitor information current. Testing done: - Mount/Delete 1.0.0 plugin created volume with monitors in the StorageClass - Mount/Delete 1.0.0 plugin created volume with monitors in the secret with a key "monitors" - Mount/Delete 1.0.0 plugin created volume with monitors in the secret with a user specified key - PVC creation and deletion with the current version (to ensure at the minimum no broken functionality) - Tested some negative cases, where monitor information is missing in secrets or present with a different key name, to understand if failure scenarios work as expected Updates #378 Follow-up work: - Documentation on how to upgrade to 1.1 plugin and retain above functionality for older volumes Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-08 15:40:17 +00:00
Madhu Rajanna	a38986fce0	Enable all static-checks in golangci-lint * Enable all static-checks in golangci-lint * Update golangci-lint version * Fix issue found in golangci-lint Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-10 15:56:17 +05:30
ShyamsundarR	b9cd0e18ad	Make CephFS plugin stateless reusing RADOS based journal scheme This is a part of the stateless set of commits for CephCSI. This commit removes the dependency on config maps to store cephFS provisioned volumes, and instead relies on RADOS based objects and keys, and required CSI VolumeID encoding to detect the provisioned volumes. Changes: - Provide backward compatibility to provisioned volumes by older plugin versions (1.0.0 or older) - Remove Create/Delete support for statically provisioned volumes (fixes #382) - Added namespace support to RADOS OMaps and used the same to store RADOS CSI objects and keys in the CephFS metadata pool - Added support to mention fsname for CephFS provisioning (fixes #359) - Changed field name in CSI Identifier to 'location', to denote a pool or fscid - Updated mounter cache to use new scheme - Required Helm manifests are updated - Required documentation and other manifests are updated - Made driver option 'metadatastorage' as optional, as fresh installs do not need to specify the same Testing done: - Create/Mount/Delete PVC - Create/Delete 5 PVCs - Mount version 1.0.0 PVC - Delete version 1.0.0 PV - Mount Statically defined PV/PVC/Pod - Mount Statically defined version 1.0.0 PV/PVC/Pod - Delete Statically defined version 1.0.0 PV/PVC/Pod - Node restart when mounted to test mountcache - Use InstanceID other than 'default' - RBD basic round of tests, as namespace is added to OMaps - csitest against ceph-fs plugin - NOTE: CephFS plugin still does not detect and address already created volumes but of a different size - Test not providing any value to the metadata storage parameter Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-05-30 06:20:35 -04:00

1 2

73 Commits