ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-05-22 07:16:41 +00:00

Author	SHA1	Message	Date
Madhu Rajanna	0e67c8da24	rbd: fix checkHealthyPrimary to consider up+stopped state we need to check for image should be in up+stopped state not anyone of the state for that the we need to use OR check not the AND check. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 53e76fab692750bb74e630ab25bb7052b366b420)	2022-06-28 15:39:59 +00:00
Madhu Rajanna	4b7f0a0541	revert: rbd: consider remote image health for primary When the image is force promoted to primary on the cluster the remote image might not be in replaying state because due to the split brain state. This PR reverts back the commit c3c87f2ef33e8d8ad08d7d9f28b59d1aedc4ef31. Which we added to check the remote image status. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 704cb5c941f7850800286e955c2db1de409ca35b)	2022-06-28 15:39:59 +00:00
Prasanna Kumar Kalever	d18eaceab4	rbd: healer detect Kubernetes version for right StagingTargetPath Kubernetes 1.24 and newer use a different path for staging the volume. That means the CSI-driver is requested to mount the volume at an other location, compared to previous versions of Kubernetes. CSI-drivers implementing the volumeHealer, must receive the correct path, otherwise the after a nodeplugin restart the NBD mounts will bailout attempting to NodeStageVolume() call and return an error. See-also: kubernetes/kubernetes#107065 Fixes: #3176 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com> (cherry picked from commit 1da446d2f2496e3e44c94cdfeb531b94c44dee97)	2022-06-24 16:16:01 +00:00
Madhu Rajanna	471c1342b1	rbd: issue resync only if the force flag is set During failover we do demote the volume on the primary as the image is still not promoted yet on the remote cluster, there are spurious split-brain errors reported by RBD, the Cephcsi resync will attempt to resync from the "known" secondary and that will cause data loss Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 3acaa018dbb971df40b29a848eba1ca0c0420299)	2022-06-24 08:16:39 +00:00
Madhu Rajanna	d7c7f29d77	rbd: create token and use it for vault SA create the token if kubernetes version in 1.24+ and use it for vault sa. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Rakshith R <rar@redhat.com> (cherry picked from commit 7a2dd4c3cf5891fc3d7627843b124dcdf4f8abf9)	2022-06-17 14:10:18 +00:00
Madhu Rajanna	417f9f2030	cephfs: skip NetNamespaceFilePath if the volume is pre-provisioned In case of pre-provisioned volume the clusterID is not set in the volume context as the clusterID is missing we cannot extract the NetNamespaceFilePath from the configuration file. For static volume and dynamically provisioned volume the clusterID is set. Note:- This is a special case to support mounting PV without clusterID parameter. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit c9943320ace5792f8549e85fb6a6121ba68aaf70)	2022-06-03 12:47:27 +00:00
Rakshith R	9be88e5af5	rbd: use vaultAuthPath variable name in error msg Before the change, the error msg was the following: ``` failed to set VAULT_AUTH_MOUNT_PATH in Vault config: path is empty ``` `vaultAuthPath` is the actual variable name set by the user. The error message will now be the following: ``` failed to set "vaultAuthPath" in vault config: path is empty ``` Signed-off-by: Rakshith R <rar@redhat.com> (cherry picked from commit 7688306f87da143bf8c869c871fc6ef02a315baa)	2022-05-26 10:01:07 +00:00
Prasanna Kumar Kalever	9f5908d873	rbd: fix bug handling GetKrbdSupportedFeatures() continue running rbd driver when /sys/bus/rbd/supported_features file is missing, do not bailout. Fixes: #2678 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com> (cherry picked from commit 6470cf334307965ccc859a88df49cabd6351ab7c)	2022-05-18 14:11:18 +00:00
Prasanna Kumar Kalever	a67bf8928c	rbd: handle when krbdFeatures is zero krbdFeatures is set to zero when kernel version < 3.8, i.e. in case where /sys/bus/rbd/supported_features is absent and we are unable to prepare the krbd attributes based on kernel version. When krbdFeatures is set to zero fallback to NBD only when autofallback is turned ON. Fixes: #2678 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com> (cherry picked from commit 83cc1b0e5804b0bcd04bd5433891fa505fc51ab8)	2022-05-18 14:11:18 +00:00
Prasanna Kumar Kalever	6894b4b910	rbd: prepare krbd feature attrs if supported_features file is absent Upstream /sys/bus/rbd/supported_features is part of Linux kernel v4.11.0 Prepare the attributes and use them in case if /sys/bus/rbd/supported_features is missing. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com> (cherry picked from commit e53fd8715422dc436d61e1fae09c8370aaa650ce)	2022-05-18 14:11:18 +00:00
Madhu Rajanna	691714228e	rbd: consider rbd as default mounter if not set For the default mounter the mounter option will not be set in the storageclass and as it is not available in the storageclass same will not be set in the volume context, Because of this the mapOptions are getting discarded. If the mounter is not set assuming it's an rbd mounter. Note:- If the mounter is not set in the storageclass we can set it in the volume context explicitly, Doing this check-in node server to support backward existing volumes and the check is minimal we are not altering the volume context. fixes: #3076 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 70674565df3e593e5b9127d9700374e598638d5e)	2022-05-10 14:49:27 +00:00
Rakshith R	584c87ce34	rbd: support pvc-pvc clone with different sc & encryption This commit makes modification so as to allow pvc-pvc clone with different storageclass having different encryption configs. This commit also modifies `copyEncryptionConfig()` to include a `isEncrypted()` check within the function. Signed-off-by: Rakshith R <rar@redhat.com> (cherry picked from commit f1ccc4eced70322dd824f57d0e253fe71993996d)	2022-05-06 17:37:36 +00:00
Rakshith R	272182a588	rbd: use `vaultAuthPath` variable name in error msg Before the change, the error msg was the following: ``` failed to set VAULT_AUTH_MOUNT_PATH in Vault config: path is empty ``` `vaultAuthPath` is the actual variable name set by the user. The error message will now be the following: ``` failed to set "vaultAuthPath" in vault config: path is empty ``` Signed-off-by: Rakshith R <rar@redhat.com> (cherry picked from commit bd57feb26e19a9e6db1ecfa7a8a07521277329eb)	2022-05-05 06:54:32 +00:00
Niels de Vos	5c0f9d565e	nfs: delete the CephFS volume when the export is already removed In case the NFS-export has already been removed from the NFS-server, but the CSI Controller was restarted, a retry to remove the NFS-volume will fail with an error like: > GRPC error: ....: response status not empty: "Export does not exist" When this error is reported, assume the NFS-export was already removed from the NFS-server configuration, and continue with deleting the backend volume. Signed-off-by: Niels de Vos <ndevos@redhat.com> (cherry picked from commit 9d7faf850f73a80794d53aa253da9f48f084bd8a)	2022-05-05 03:27:30 +00:00
Madhu Rajanna	c83a281857	cephfs: add netNamespaceFilePath for CephFS as same host directory is not shared between the cephfs and the rbd plugin pod. we need to keep the netNamespaceFilePath separately for both cephfs and rbd. CephFS plugin will use this path to execute mount -t commands. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit d2bc9743f751d60c24db88a87f288ca48e945308)	2022-04-19 16:33:59 +00:00
Madhu Rajanna	a901997542	cleanup: use block comment for ClusterInfo example Adjusted the mix of tabs and the spaces and also used block comment for better readability. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit eb4bfb73268e6c210e9d112d24e6700db601e0bd)	2022-04-19 16:33:59 +00:00
Madhu Rajanna	f8a19c8cbb	rbd: move radosNamespace to RBD section As radosNamespace is more specific to RBD not the general ceph configuration. Now we introduced a new RBD section for RBD specific options, Moving the radosNamespace to RBD section and keeping the radosNamespace still under the global ceph level configration for backward compatibility. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit b4acbd08a5e5c6169dd52cc99be5ccfad06419d1)	2022-04-19 16:33:59 +00:00
Madhu Rajanna	76398d6887	util: Add RBD specific options in clusterInfo As the netNamespaceFilePath can be separate for both cephfs and rbd adding the netNamespaceFilePath path for RBD, This will help us to keep RBD and CephFS specific options separately. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 766346868e61db4639d80494cc3f2fb2ed2fc6c2)	2022-04-19 16:33:59 +00:00
Niels de Vos	61ca06148e	nfs: return gRPC status from CephFS CreateVolume failure The NFS Controller returns a non-gRPC error in case the CreateVolume call for the CephFS volume fails. It is better to return the gRPC-error that the CephFS Controller passed along. Signed-off-by: Niels de Vos <ndevos@redhat.com> (cherry picked from commit 2b71aac752614182fb0ba5dd956307401d2920b1)	2022-04-19 10:41:27 +00:00
Niels de Vos	3ce0e1fa50	nfs: use go-ceph API for creating/deleting exports Recent versions of Ceph allow calling the NFS-export management functions over the go-ceph API. This seems incompatible with older versions that have been tested with the `ceph nfs` commands that this commit replaces. Signed-off-by: Niels de Vos <ndevos@redhat.com> (cherry picked from commit 28369702d275381b559f77c7e38adf599128e457)	2022-04-15 13:13:31 +00:00
Madhu Rajanna	e61012da14	rbd: use leases for leader election use leases for leader election instead of the deprecated configmap based leader election. This PR is making leases as default leader election refer https://github.com/kubernetes-sigs/ controller-runtime/pull/1773, default from configmap to configmap leases was done with https://github.com/kubernetes-sigs/ controller-runtime/pull/1144. Release notes https://github.com/kubernetes-sigs/ controller-runtime/releases/tag/v0.7.0 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit d886ab0d6634bca5a5b055bb7783138a2e3e5ded)	2022-04-15 10:24:19 +00:00
Madhu Rajanna	ebf2677b30	util: fix logging in ExecuteCommandWithNSEnter log the nsenter and its argument after executing the command with the nsenter CLI. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit c245436ec4f89e6801a21782da64aaa0d370070b)	2022-04-14 16:33:49 +00:00
Madhu Rajanna	3521465e60	rbd: check nbd tool features only for rbd driver calling setRbdNbdToolFeatures inside an init gets called in main.go for both cephfs and rbd driver. instead of calling it in init function calling this in rbd driver.go as this is specific to rbd. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit dffb6e72c2ab48d7b90197e405ae0d3d76a5fbc0)	2022-04-14 09:17:45 +00:00
Madhu Rajanna	db1b1dd6ec	rbd: consider remote image health for primary To consider the image is healthy during the Promote operation currently we are checking only the image state on the primary site. If the network is flaky or the remote site is down the image health is not as expected. To make sure the image is healthy across the clusters check the state on both local and the remote clusters. some details: https://bugzilla.redhat.com/show_bug.cgi?id=2014495 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 64a9b1fa5906d65478cdb5fb244b133bc1b1cfbe)	2022-04-13 10:57:40 +00:00
Madhu Rajanna	3161a6b060	util: add support for the nsenter add support to run rbd map and mount -t commands with the nsenter. complete design of pod/multus network is added here https://github.com/rook/rook/ blob/master/design/ceph/multus-network.md#csi-pods Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit 7b2aef0d8120033f232e81204616f1d09df708cd)	2022-04-08 14:44:20 +00:00
Prasanna Kumar Kalever	d760d0ab6d	rbd: check for cookie support from kernel Currently we only check if the rbd-nbd tool supports cookie feature. This change will also defend cookie addition based on kernel version Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-04 09:51:13 +00:00
Madhu Rajanna	f8bbd2f60f	cephfs: fix omap deletion in DeleteSnapshot The omap is stored with the requested snapshot name not with the subvolume snapshotname. This fix uses the correct snapshot request name to cleanup the omap once the subvolume snapshot is deleted. fixes: #2974 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-31 13:46:03 +00:00
Niels de Vos	1da19680b4	nfs: support new and old NFS-management commands The `ceph nfs export ...` commands have changed in recent Ceph releases. Use the most recent command as a default, fall back to the older command when an error is reported. This shoud make the NFS-provisioner work on any current Ceph version. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-31 11:28:40 +00:00
Madhu Rajanna	f90408be4d	rbd: increase force promote timeout to 2 minutes Increase the timeout to 2 minutes to give enough time for rollback to complete. As rollback is performed by the force-promote command it, at times, may take more than a minute (based on dirty blocks that need to be rolled back approximately) to rollback. The added extra 1 minute is useful though to avoid multiple calls to complete the rollback and in extremely corner cases to avoid failures in the first instance of the call when the mirror watcher is not yet removed (post scaling down the RBD mirror instance) Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-30 13:46:27 +00:00
Thibaut Blanchard	e874c9c11b	rbd: fix topology snapshot pool Restoring a snapshot with a new PVC results with a wrong dataPoolName in case of initial volume linked to a storageClass with topology constraints and erasure coding. Signed-off-by: Thibaut Blanchard <thibaut.blanchard@gmail.com>	2022-03-30 04:40:30 +00:00
Niels de Vos	885295fcc9	nfs: store the NFS-cluster name in the journal Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	3b4d193ca8	journal: add StoreAttribute/FetchAttribute Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	010fd816dd	nfs: store the calling Context in NFSVolume NFSVolume instances are short lived, they only extist for a certain gRPC procedure. It is easier to store the calling Context in the NFSVolume struct, than to pass it to some of the functions that require it. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	6d83df9cc9	nfs: add basic provisioner with create/delete procedures These NFS Controller and Identity servers are the base for the new provisioner. The functionality is currently extremely limited, follow-up PRs will implement various CSI procedures. CreateVolume is implemented with the bare minimum. This makes it possible to create a volume, and mount it with the kubernetes-csi/csi-driver-nfs NodePlugin. DeleteVolume unexports the volume from the Ceph managed NFS-Ganesha service. In case the Ceph cluster provides multiple NFS-Ganesha deployments, things might not work as expected. This is going to be addressed in follow-up improvements. Lots of TODO comments need to be resolved before this can be declared "production ready". Unit- and e2e-tests are missing as well. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Robert Vasek	f6ae612003	util: added reference tracker RT, reference tracker, is key-based implementation of a reference counter. Unlike an integer-based counter, RT counts references by tracking unique keys. This allows accounting in situations where idempotency must be preserved. It guarantees there will be no duplicit increments or decrements of the counter. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-27 19:24:26 +00:00
Rakshith R	40de75e0db	rbd: modify oidc token file path according to FHS 3.0 OIDC token file path has been modified from `/var/run/secrets/token` to `/run/secrets/tokens`. This has been done to ensure compliance with FHS 3.0. refer: https://refspecs.linuxfoundation.org/FHS_3.0/fhs/ch05s13.html Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-23 13:29:35 +00:00
Madhu Rajanna	8c5e414d53	rbd: do not read pvc namespace from volume attributes Below are the 3 different cases where we need the PVC namespace for encryption * CreateVolume:- Read the namespace from the createVolume parameters and store it in the omap * NodeStage:- Read the namespace from the omap not from the volumeContext * Regenerate:- Read the pvc namespace from the claimRef not from the volumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	77011fbc61	cephfs: remove kubernetes csi prefixed parameters remove kubernetes csi prefixed parameters from the volumeContext as we dont want to store it in the PV VolumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	a7315a04c1	rbd: remove kubernetes csi prefixed parameters remove kubernetes csi prefixed parameters from the volumeContext as we dont want to store it in the PV VolumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	366c2ace31	util: add helper to get pvcnamespace from input added helper function to return the pvc namespace name from the input parameters. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	772fe8d6c8	util: add helper function to strip kube parameters added helper function to strip the kubernetes specific parameters from the volumeContext as volumeContext is storaged in the PV volumeAttributes Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Rakshith R	a56f9a0c05	rbd: flatten datasource image before creating volume This commit ensures that parent image is flattened before creating volume. - If the data source is a PVC, the underlying image's parent is flattened(which would be a temp clone or snapshot). hard & soft limit is reduced by 2 to account for depth that will be added by temp & final clone. - If the data source is a Snapshot, the underlying image is itself flattened. hard & soft limit is reduced by 1 to account for depth that will be added by the clone which will be restored from the snapshot. Flattening step for resulting PVC image restored from snapshot is removed. Flattening step for temp clone & final image is removed when pvc clone is being created. Fixes: #2190 Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-18 10:27:27 +00:00
Madhu Rajanna	d357bebbc2	cephfs: disallow creating small volumes from snapshot/volume as per the CSI standard the size is optional parameter, as we are allowing the clone to a bigger size today we need to block the clone to a smaller size as its a have side effects like data corruption etc. Note:- Even though this check is present in kubernetes sidecar as CSI is CO independent adding the check here. fixes: #2718 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-17 05:07:26 +00:00
Humble Chirammal	525ff5d97f	rbd: remove unimplemented responses for node operations These RPCs( nodestage,unstage,volumestats) are implemented RPCs for our drivers atm. This commit removes the `unimplemented` responses from the common/default server initialization routins. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-03-16 15:27:48 +00:00
Humble Chirammal	66e7f3525f	cleanup: remove unimplemented controller expand,snapshot RPCs These RPCs ( controller expand, create and delete snapshots) are no longer unimplmented and we dont have to declare these as with `unimplemented` states. This commit remove the same. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-03-16 15:27:48 +00:00
Rakshith R	4f0bb2315b	rbd: add `aws-sts-metdata` encryption type With Amazon STS and kubernetes cluster is configured with OIDC identity provider, credentials to access Amazon KMS can be fetched using oidc-token(serviceaccount token). Each tenant/namespace needs to create a secret with aws region, role and CMK ARN. Ceph-CSI will assume the given role with oidc token and access aws KMS, with given CMK to encrypt/decrypt DEK which will stored in the image metdata. Refer: https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html Resolves: #2879 Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-16 07:29:56 +00:00
Prasanna Kumar Kalever	3eb0fa5e21	rbd: fix parsing mapOptions Currently, we support mapOption: "krbd:v1,v2,v3;nbd:v1,v2,v3" - By omitting `krbd:` or `nbd:`, the option(s) apply to rbdDefaultMounter which is krbd. - A user can _override_ the options for a mounter by specifying `krbd:` or `nbd:`. mapOption: "v1,v2,v3;nbd:v1,v2,v3" is effectively the same as the 1st example. - Sections are split by `;`. - If users want to specify common options for both `krbd` and `nbd`, they should mention them twice. But in case if the krbd or nbd specifc options contian `:` within them, then the parsing is failing now. E0301 10:19:13.615111 7348 utils.go:200] ID: 63 Req-ID: 0001-0009-rook-ceph-0000000000000001-fd37c41b-9948-11ec-ad32-0242ac110004 GRPC error: badly formatted map/unmap options: "krbd:read_from_replica=localize,crush_location=zone:zone1;" This patch fix the above case where the options itself contain `:` delimitor ex: krbd:v1,v2,v3=v31:v32;nbd:v1,v2,v3" Please note, if you are using such options which contain `:` delimiter, then it is mandatory to specify the mounter-type. Fixes: #2910 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-03-14 15:21:25 +00:00
Madhu Rajanna	78ec859dc6	cleanup: remove unwanted print Removing unwanted print from the code Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-11 05:40:32 +00:00
Robert Vasek	80dda7cc30	cephfs: detect corrupt ceph-fuse mounts and try to remount Mounts managed by ceph-fuse may get corrupted by e.g. the ceph-fuse process exiting abruptly, or its parent container being terminated, taking down its child processes with it. This commit adds checks to NodeStageVolume and NodePublishVolume procedures to detect whether a mountpoint in staging_target_path and/or target_path is corrupted, and remount is performed if corruption is detected. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-10 06:05:52 +00:00
Robert Vasek	aa6297e164	cleanup: refactor helper functions in nodeserver.go Refactored a couple of helper functions for easier resue. * Code for building store.VolumeOptions is factored out into a separate function. * Changed args of getCredentailsForVolume() and NodeServer.mount() so that instead of passing in whole csi.NodeStageVolumeRequest, only necessary properties are passed explicitly. This is to allow these functions to be called outside of NodeStageVolume() where NodeStageVolumeRequest is not available. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-10 06:05:52 +00:00

1 2 3 4 5 ...

882 Commits