ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2024-12-18 11:00:25 +00:00

Author	SHA1	Message	Date
Prasanna Kumar Kalever	b6a88dd728	rbd: add volume healer Problem: ------- For rbd nbd userspace mounter backends, after a restart of the nodeplugin all the mounts will start seeing IO errors. This is because, for rbd-nbd backends there will be a userspace mount daemon running per volume, post restart of the nodeplugin pod, there is no way to restore the daemons back to life. Solution: -------- The volume healer is a one-time activity that is triggered at the startup time of the rbd nodeplugin. It navigates through the list of volume attachments on the node and acts accordingly. For now, it is limited to nbd type storage only, but it is flexible and can be extended in the future for other backend types as needed. From a few feets above: This solves a severe problem for nbd backed csi volumes. The healer while going through the list of volume attachments on the node, if finds the volume is in attached state and is of type nbd, then it will attempt to fix the rbd-nbd volumes by sending a NodeStageVolume request with the required volume attributes like secrets, device name, image attributes, and etc.. which will finally help start the required rbd-nbd daemons in the nodeplugin csi-rbdplugin container. This will allow reattaching the backend images with the right nbd device, thus allowing the applications to perform IO without any interruptions even after a nodeplugin restart. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	6007fc9bfe	cleanup: move static volume check to helper function Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	6d24080851	rbd: update per volume metadata stash-file with devicePath As part of stage transaction if the mounter is of type nbd, then capture device path after a successful rbd-nbd map. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Humble Chirammal	61bf49a4f5	rbd: Get rid of locking at nodePublish Considering kubelet make sure the stage and publish operations are serialized, we dont need any extra locking in nodePublish Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-16 07:18:56 +00:00
Humble Chirammal	ef852cc93d	rbd: Get rid of locking at nodeUnpublish call Considering kubelet make sure the unstage and unpublish operations are serialized, we dont need any extra locking in nodeUnpublish Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-16 07:18:56 +00:00
Yati Padia	f36d611ef9	cleanup: resolves gofumpt issues of internal codes This PR runs gofumpt for internal folder. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-14 19:50:56 +00:00
Yati Padia	4a649fe17f	cleanup: resolve godot linter This commit resolves godot linter issue which says "Comment should end in a period (godot)". Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-13 06:50:03 +00:00
Yati Padia	ffab37f44f	cleanup: resolves gocritic linter issues This commit resolves gocritic linter errors. Updates: #2250 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-08 05:19:26 +00:00
Rakshith R	1b23d78113	rebase: update kubernetes to v1.21.2 Updated kubernetes packages to latest release. resizefs package has been included into k8s.io/mount-utils package. updated code to use the same. Updates: #1968 Signed-off-by: Rakshith R <rar@redhat.com>	2021-07-01 03:35:23 +00:00
Humble Chirammal	e829308249	internal: reformat long lines in internal/rbd package to 120 chars We have many declarations and invocations..etc with long lines which are very difficult to follow while doing code reading. This address the issues in 'internal/rbd/*server.go' and 'internal/rbd/driver.go' files to restrict the line length to 120 chars. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-06-28 14:43:49 +00:00
Rakshith R	404e011ae9	cleanup: added helper func isNotMountPoint Added helper func isNotMountPoint to check mountPoint, validate error and reduce complexity of NodeStageVolume. Signed-off-by: Rakshith R <rar@redhat.com>	2021-06-28 05:46:42 +00:00
Rakshith R	84b046d736	rbd: add check for imageFeatures parameter This commit adds checks for missing `imageFeatures` parameter in createvolumerequest and nodestagerequest(only for static PVs). Missing `imageFeatures` parameter is ignored in case of non-static PVs to ensure backwards compatibility with older versions which did not have `imageFeatures` as required parameter. Signed-off-by: Rakshith R <rar@redhat.com>	2021-06-28 05:46:42 +00:00
Niels de Vos	4908ff8743	rbd: no need to flatten thick-provisioned images Thick-provisioned images are independent, cloned images or snapshots are deep-flattened during creation. There is no need to try and flatten them again. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Yati Padia	6bfdf2feb0	cleanup: gocyclo being unused for linter This commit addresses the following issue: 'nolint:gocyclo // complexity needs to be reduced.' is unused for linter "gocyclo" (nolintlint) Updates:#2025 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-06-15 02:54:16 +00:00
Niels de Vos	25d0a1cfc0	rbd: add support for block-devices in NodeGetVolumeStats() The NodeGetVolumeStats procedure can now be used to fetch the capacity of the RBD block-device. By default this is a thin-provisioned device, which means that the capacity is not reserved in the Ceph cluster. This makes it possible to over-provision the cluster. In order to detect the amount of storage used by the RBD block-device (when thin-provisioned), it is required to connect to the Ceph cluster. Unfortunately, the NodeGetVolumeStats CSI procedure does not provide enough parameters to connect to the Ceph cluster and fetch more details about the RBD image. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-05-25 06:41:04 +00:00
Madhu Rajanna	67d73cd6e9	rbd: flatten image if the depth is not zero flatten the image if the deep-flatten feature is present on the images in the chain or if the images in chain is not zero, as we cannot check the deep-flatten feature the images which are in trash. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-05-07 07:57:37 +00:00
Niels de Vos	f11a041f56	cleanup: address gosec complaint about creating a file The new gosec 2.7.0 complains like: G304 (CWE-22): Potential file inclusion via variable (Confidence: HIGH, Severity: MEDIUM) Updates: #2025 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-05-05 16:05:23 +00:00
Rakshith R	020cded581	cleanup: refactor deeply nested if statements in internal/rbd Refactored deeply nested if statement in internal/rbd to reduce cognitive complexity. Signed-off-by: Rakshith R <rar@redhat.com>	2021-04-07 02:31:41 +00:00
Niels de Vos	aaeb35eceb	rbd: encrypted volumes can be of type "crypto_LUKS" too It seems that newer versions of some tools/libraries identify encrypted filesystems with `crypto_LUKS` instead of `crypt`. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-06 15:54:27 +00:00
Niels de Vos	d4076d6216	util: introduce VolumeEncryption type Prepare for grouping encryption related functions together. The main rbdVolume object should not be cluttered with KMS or DEK procedures. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-12 10:11:47 +00:00
Madhu Rajanna	cbb10fd84d	rbd: add more logging for NodeUnstageVolume For NodeUnstageVolume its a two step process, first unmount the volume and than unmap the volume. Currently, we are logging only after rbd unmapping is done. sometimes it becomes difficult to debug with above logging whether more time is spent in unmount or unmap. This commits adds one more debug log after unmount is done. with this we can identify where exactly more time is spent by looking at the logs. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-11 17:40:57 +00:00
Niels de Vos	cf6dae86e9	rbd: move encryptDevice() to a method of rbdVolume Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-24 13:16:11 +00:00
Niels de Vos	fb065b0f39	rbd: move openEncryptedDevice() to a method of rbdVolume Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-24 13:16:11 +00:00
Niels de Vos	4937e59c4d	rbd: add backwards compatible encryption in NodeStageVolume When a volume was provisioned by an old Ceph-CSI provisioner, the metadata of the RBD image will contain `requiresEncryption` to indicate a passphrase needs to be created. New Ceph-CSI provisioners create the passphrase in the CreateVolume request, and set `encryptionPrepared` instead. When a new node-plugin detects that `requiresEncryption` is set in the RBD image metadata, it will fallback to the old behaviour. In case `encryptionPrepared` is read from the RBD image metadata, the passphrase is used to cryptsetup/format the image. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-17 17:51:13 +00:00
Niels de Vos	9b6c2117f3	rbd: set encryption passphrase on CreateVolume Have the provisioner create the passphrase for the volume, instead of doign it lazily at the time the volume is used for the 1st time. This prevents potential races where pods on different nodes try to store different passphrases at the (almost) same time. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-17 17:51:13 +00:00
Madhu Rajanna	9c7176dbb4	rbd: update mount packges in import path mount packges is moved from k8s.io/utils/mount to a new repository k8s.io/mount-utils. updated code to use the same. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-17 16:04:54 +00:00
Niels de Vos	4dde3fc9e0	cleanup: return error type in encryptDevice() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	d6fb8f302d	cleanup: return error type in NodeServer.processEncryptedDevice() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	8019e4d1bc	rbd: return CSI status-error on resize failure Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	65a10fd553	cleanup: standardize error format in NodeServer.NodeStageVolume() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Humble Chirammal	70358c8eb7	rbd: volJournal.Connect() return wrongly pushed to caller volJournal.Connect() got the error on err2 variable, however the return was on variable err which hold the error return of DecomposeCSIID() which is wrong. This cause the error return wrongly parsed and pushed from the caller. From now on, we are reusing the err variable to hold and revert the error of volJournal.Connect(). Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-20 12:45:51 +00:00
Madhu Rajanna	d1f175d9f3	rbd: add support for rbd map and unmap options added support for providing map and unmap options to rbd CLI when mapping rbd image on the node. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-09-21 13:27:28 +00:00
Mudit Agarwal	4de1abad5e	rbd: NodeExpandVolume() should use StagingTargetPath Form kubernetes v1.19 onwards NodeRequest is getting volume path in StagingTargetPath instead of VolumePath, cephcsi should also use the same. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-08-25 15:58:44 +00:00
Madhu Rajanna	e768c0dfc0	rbd: replace klog with util logger in nodeserver.go replace klog with util logger in nodeserver.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Humble Chirammal	1f5b84745f	rbd: Bail out from nodeexpansion if its block mode pvc At CSI spec < 1.2.0, there was no volumecapability in the expand request. However its available from v1.2+ which allows us to declare the node operations based on the volume mode. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-08-19 12:34:20 +00:00
Mehdy Khoshnoody	fc5eadf106	rbd: Add rados namespace support for rbd Make sure to operate within the namespace if any given when dealing with rbd images and snapshots and their journals. Signed-off-by: Mehdy Khoshnoody <mehdy.khoshnoody@gmail.com>	2020-08-12 16:22:58 +05:30
Niels de Vos	47d5b60af8	rbd: disable reflink while creating XFS filesystems Current versions of the mkfs.xfs binary enable reflink support by default. This causes problems on systems where the kernel does not support this feature. When the kernel the feature does not support, but the filesystem has it enabled, the following error is logged in `dmesg`: XFS: Superblock has unknown read-only compatible features (0x4) enabled Introduce a check to see if mkfs.xfs supports the `-m reflink=` option. In case it does, pass `-m reflink=0` while creating the filesystem. The check is executed once during the first XFS filesystem creation. The result of the check is cached until the nodeserver restarts. Fixes: #966 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 13:37:51 +00:00
Sven Anderson	92884f56f4	rbd: simplify error handling This change replaces the sentinel errors in rbd module with standard errors created with errors.New(). Related: #1203 Signed-off-by: Sven Anderson <sven@redhat.com>	2020-07-23 11:16:40 +00:00
Yug	71ddf51544	cleanup: address gomnd warnings Direct usage of numbers should be avoided. Issue reported: mnd: Magic number: X, in <argument> detected (gomnd) Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Yug	48fa43270f	cleanup: address gocritic warnings Add explanation to nolint directives. Issue reported: whyNoLint: include an explanation for nolint directive (gocritic) Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Yug	7f94a57908	cleanup: address godot warnings Top level comments should end in a period Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Madhu Rajanna	d15ded88f5	cleanup: Remove support for Delete and Unmounting v1.1.0 PVC as v1.0.0 is deprecated we need to remove the support for it in the Next coming (v3.0.0) release. This PR removes the support for the same. closes #882 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-10 16:07:13 +00:00
Madhu Rajanna	a0fd805a8b	rbd: Add support for smart cloning Added support for RBD PVC to PVC cloning, below commands are executed to create a PVC-PVC clone from RBD side. * Check the depth(n) of the cloned image if n>=(hard limit -2) or ((soft limit-2) Add a task to flatten the image and return about (to avoid image leak) Note will try to flatten the temp clone image in the chain if available * Reserve the key and values in omap (this will help us to avoid the leak as it's not reserved earlier as we have returned ABORT (the request may not come back)) * Create a snapshot of rbd image * Clone the snapshot (temp clone) * Delete the snapshot * Snapshot the temp clone * Clone the snapshot (final clone) * Delete the snapshot ```bash 1) check the image depth of the parent image if flatten required add a task to flatten image and return ABORT to avoid leak (hardlimit-2 and softlimit-2 check will be done) 2) Reserve omap keys 2) rbd snap create <RBD image for src k8s volume>@<random snap name> 3) rbd clone --rbd-default-clone-format 2 --image-feature layering,deep-flatten <RBD image for src k8s volume>@<random snap> <RBD image for temporary snap image> 4) rbd snap rm <RBD image for src k8s volume>@<random snap name> 5) rbd snap create <cloned RBD image created in snapshot process>@<random snap name> 6) rbd clone --rbd-default-clone-format 2 --image-feature <k8s dst vol config> <RBD image for temporary snap image>@<random snap name> <RBD image for k8s dst vol> 7)rbd snap rm <RBD image for src k8s volume>@<random snap name> ``` * Delete temporary clone image created as part of clone(delete if present) * Delete rbd image Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-10 14:02:12 +00:00
Yug	1490daed7e	cleanup: Avoid usage of numbers Add seperate functions to handle all levels and types of logging. Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-10 07:41:23 +00:00
Yug	8dc4ab6b1b	rebase: update k8s.io/klog to v2.3.0 Update klog version to v2.3.0 Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-10 07:41:23 +00:00
Madhu Rajanna	8f758450d8	rbd: add RHEL 8.2 kernel to the list as RHEL 8.2 supports the deep-flatten feature, added it to the list to map the rbd images on the node without flattening. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-06 12:20:00 +00:00
Madhu Rajanna	04c8c7fd4a	rbd: correct upstream kernel version for deep-flatten as v5.1.0 supports the deep-flatten feature,lowering the required version to map rbd images which are having deep-flatten feature Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-06 12:20:00 +00:00
Niels de Vos	d4dad7c189	cleanup: use errors.As() in rbd.NodeUnstageVolume() See-also: https://github.com/golang/go/wiki/ErrorValueFAQ#how-should-i-change-my-error-handling-code-to-work-with-the-new-features Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-03 09:12:48 +00:00
Madhu Rajanna	b085577a4f	rbd: add skipForceFlatten flag added skipForceFlatten flag to skip the image deptha and skip image flattening. This will be very useful if the kernel is not listed in cephcsi which supports deep flatten fauture. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-01 08:21:47 +00:00
Madhu Rajanna	649aeb7aaf	rbd: Add support for rbd ROX PVC mounting if the PVC access mode is ReadOnlyMany or single node readonly, mounting the rbd device path to the staging path as readonly to avoid the write operation. If the PVC acccess mode is readonly, mapping rbd images as readonly. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-06-22 06:15:40 +00:00

1 2

58 Commits