ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-06-03 04:16:42 +00:00

Author	SHA1	Message	Date
Madhu Rajanna	64a9b1fa59	rbd: consider remote image health for primary To consider the image is healthy during the Promote operation currently we are checking only the image state on the primary site. If the network is flaky or the remote site is down the image health is not as expected. To make sure the image is healthy across the clusters check the state on both local and the remote clusters. some details: https://bugzilla.redhat.com/show_bug.cgi?id=2014495 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-13 08:37:23 +00:00
Madhu Rajanna	dffb6e72c2	rbd: check nbd tool features only for rbd driver calling setRbdNbdToolFeatures inside an init gets called in main.go for both cephfs and rbd driver. instead of calling it in init function calling this in rbd driver.go as this is specific to rbd. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-11 21:18:27 +00:00
Humble Chirammal	959df4dbac	doc: correct typos in struct field comments and release.md corrected strings in the release guide and util server. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-04-11 06:23:25 +00:00
Prasanna Kumar Kalever	41fe2c7dda	rbd: set metadata on the snapshot Set snapshot-name/snapshot-namespace/snapshotcontent-name details on RBD backend snapshot image as metadata on snapshot Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	0ef79c6fc0	rbd: set metadata on restart of provisioner pod Make sure to set metadata when image exist, i.e. if the provisioner pod is restarted while createVolume is in progress, say it created the image but didn't yet set the metadata. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	ae5925f04c	rbd: update PV/PVC metadata on a reattach of PV Example if a PVC was delete by setting `persistentVolumeReclaimPolicy` as `Retain` on PV, and PV is reattached to a new PVC, we make sure to update PV/PVC image metadata on a PV reattach. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	0119d69ab2	rbd: set PV/PVC details on the image as metadata on create This helps Monitoring solutions without access to Kubernetes clusters to display the details of the PV/PVC/NameSpace in their dashboard. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	4d750ed0e5	rbd: add set/Get VolumeMetadata() utility function Define and use PV and PVC metadata keys used by external provisioner. The CSI external-provisioner (v1.6.0+) introduces the --extra-create-metadata flag, which automatically sets map<string, string> parameters in the CSI CreateVolumeRequest. Add utility functions to set/Get PV/PVC/PVCNamespace metadata on image Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Madhu Rajanna	7b2aef0d81	util: add support for the nsenter add support to run rbd map and mount -t commands with the nsenter. complete design of pod/multus network is added here https://github.com/rook/rook/ blob/master/design/ceph/multus-network.md#csi-pods Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-08 10:23:21 +00:00
Prasanna Kumar Kalever	d760d0ab6d	rbd: check for cookie support from kernel Currently we only check if the rbd-nbd tool supports cookie feature. This change will also defend cookie addition based on kernel version Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-04 09:51:13 +00:00
Madhu Rajanna	f8bbd2f60f	cephfs: fix omap deletion in DeleteSnapshot The omap is stored with the requested snapshot name not with the subvolume snapshotname. This fix uses the correct snapshot request name to cleanup the omap once the subvolume snapshot is deleted. fixes: #2974 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-31 13:46:03 +00:00
Niels de Vos	1da19680b4	nfs: support new and old NFS-management commands The `ceph nfs export ...` commands have changed in recent Ceph releases. Use the most recent command as a default, fall back to the older command when an error is reported. This shoud make the NFS-provisioner work on any current Ceph version. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-31 11:28:40 +00:00
Madhu Rajanna	f90408be4d	rbd: increase force promote timeout to 2 minutes Increase the timeout to 2 minutes to give enough time for rollback to complete. As rollback is performed by the force-promote command it, at times, may take more than a minute (based on dirty blocks that need to be rolled back approximately) to rollback. The added extra 1 minute is useful though to avoid multiple calls to complete the rollback and in extremely corner cases to avoid failures in the first instance of the call when the mirror watcher is not yet removed (post scaling down the RBD mirror instance) Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-30 13:46:27 +00:00
Thibaut Blanchard	e874c9c11b	rbd: fix topology snapshot pool Restoring a snapshot with a new PVC results with a wrong dataPoolName in case of initial volume linked to a storageClass with topology constraints and erasure coding. Signed-off-by: Thibaut Blanchard <thibaut.blanchard@gmail.com>	2022-03-30 04:40:30 +00:00
Niels de Vos	885295fcc9	nfs: store the NFS-cluster name in the journal Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	3b4d193ca8	journal: add StoreAttribute/FetchAttribute Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	010fd816dd	nfs: store the calling Context in NFSVolume NFSVolume instances are short lived, they only extist for a certain gRPC procedure. It is easier to store the calling Context in the NFSVolume struct, than to pass it to some of the functions that require it. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Niels de Vos	6d83df9cc9	nfs: add basic provisioner with create/delete procedures These NFS Controller and Identity servers are the base for the new provisioner. The functionality is currently extremely limited, follow-up PRs will implement various CSI procedures. CreateVolume is implemented with the bare minimum. This makes it possible to create a volume, and mount it with the kubernetes-csi/csi-driver-nfs NodePlugin. DeleteVolume unexports the volume from the Ceph managed NFS-Ganesha service. In case the Ceph cluster provides multiple NFS-Ganesha deployments, things might not work as expected. This is going to be addressed in follow-up improvements. Lots of TODO comments need to be resolved before this can be declared "production ready". Unit- and e2e-tests are missing as well. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-28 11:23:17 +00:00
Robert Vasek	f6ae612003	util: added reference tracker RT, reference tracker, is key-based implementation of a reference counter. Unlike an integer-based counter, RT counts references by tracking unique keys. This allows accounting in situations where idempotency must be preserved. It guarantees there will be no duplicit increments or decrements of the counter. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-27 19:24:26 +00:00
Rakshith R	40de75e0db	rbd: modify oidc token file path according to FHS 3.0 OIDC token file path has been modified from `/var/run/secrets/token` to `/run/secrets/tokens`. This has been done to ensure compliance with FHS 3.0. refer: https://refspecs.linuxfoundation.org/FHS_3.0/fhs/ch05s13.html Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-23 13:29:35 +00:00
Madhu Rajanna	8c5e414d53	rbd: do not read pvc namespace from volume attributes Below are the 3 different cases where we need the PVC namespace for encryption * CreateVolume:- Read the namespace from the createVolume parameters and store it in the omap * NodeStage:- Read the namespace from the omap not from the volumeContext * Regenerate:- Read the pvc namespace from the claimRef not from the volumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	77011fbc61	cephfs: remove kubernetes csi prefixed parameters remove kubernetes csi prefixed parameters from the volumeContext as we dont want to store it in the PV VolumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	a7315a04c1	rbd: remove kubernetes csi prefixed parameters remove kubernetes csi prefixed parameters from the volumeContext as we dont want to store it in the PV VolumeAttributes. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	366c2ace31	util: add helper to get pvcnamespace from input added helper function to return the pvc namespace name from the input parameters. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	772fe8d6c8	util: add helper function to strip kube parameters added helper function to strip the kubernetes specific parameters from the volumeContext as volumeContext is storaged in the PV volumeAttributes Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Rakshith R	a56f9a0c05	rbd: flatten datasource image before creating volume This commit ensures that parent image is flattened before creating volume. - If the data source is a PVC, the underlying image's parent is flattened(which would be a temp clone or snapshot). hard & soft limit is reduced by 2 to account for depth that will be added by temp & final clone. - If the data source is a Snapshot, the underlying image is itself flattened. hard & soft limit is reduced by 1 to account for depth that will be added by the clone which will be restored from the snapshot. Flattening step for resulting PVC image restored from snapshot is removed. Flattening step for temp clone & final image is removed when pvc clone is being created. Fixes: #2190 Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-18 10:27:27 +00:00
Madhu Rajanna	d357bebbc2	cephfs: disallow creating small volumes from snapshot/volume as per the CSI standard the size is optional parameter, as we are allowing the clone to a bigger size today we need to block the clone to a smaller size as its a have side effects like data corruption etc. Note:- Even though this check is present in kubernetes sidecar as CSI is CO independent adding the check here. fixes: #2718 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-17 05:07:26 +00:00
Humble Chirammal	525ff5d97f	rbd: remove unimplemented responses for node operations These RPCs( nodestage,unstage,volumestats) are implemented RPCs for our drivers atm. This commit removes the `unimplemented` responses from the common/default server initialization routins. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-03-16 15:27:48 +00:00
Humble Chirammal	66e7f3525f	cleanup: remove unimplemented controller expand,snapshot RPCs These RPCs ( controller expand, create and delete snapshots) are no longer unimplmented and we dont have to declare these as with `unimplemented` states. This commit remove the same. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-03-16 15:27:48 +00:00
Rakshith R	4f0bb2315b	rbd: add `aws-sts-metdata` encryption type With Amazon STS and kubernetes cluster is configured with OIDC identity provider, credentials to access Amazon KMS can be fetched using oidc-token(serviceaccount token). Each tenant/namespace needs to create a secret with aws region, role and CMK ARN. Ceph-CSI will assume the given role with oidc token and access aws KMS, with given CMK to encrypt/decrypt DEK which will stored in the image metdata. Refer: https://docs.aws.amazon.com/STS/latest/APIReference/welcome.html Resolves: #2879 Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-16 07:29:56 +00:00
Prasanna Kumar Kalever	3eb0fa5e21	rbd: fix parsing mapOptions Currently, we support mapOption: "krbd:v1,v2,v3;nbd:v1,v2,v3" - By omitting `krbd:` or `nbd:`, the option(s) apply to rbdDefaultMounter which is krbd. - A user can _override_ the options for a mounter by specifying `krbd:` or `nbd:`. mapOption: "v1,v2,v3;nbd:v1,v2,v3" is effectively the same as the 1st example. - Sections are split by `;`. - If users want to specify common options for both `krbd` and `nbd`, they should mention them twice. But in case if the krbd or nbd specifc options contian `:` within them, then the parsing is failing now. E0301 10:19:13.615111 7348 utils.go:200] ID: 63 Req-ID: 0001-0009-rook-ceph-0000000000000001-fd37c41b-9948-11ec-ad32-0242ac110004 GRPC error: badly formatted map/unmap options: "krbd:read_from_replica=localize,crush_location=zone:zone1;" This patch fix the above case where the options itself contain `:` delimitor ex: krbd:v1,v2,v3=v31:v32;nbd:v1,v2,v3" Please note, if you are using such options which contain `:` delimiter, then it is mandatory to specify the mounter-type. Fixes: #2910 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-03-14 15:21:25 +00:00
Madhu Rajanna	78ec859dc6	cleanup: remove unwanted print Removing unwanted print from the code Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-11 05:40:32 +00:00
Robert Vasek	80dda7cc30	cephfs: detect corrupt ceph-fuse mounts and try to remount Mounts managed by ceph-fuse may get corrupted by e.g. the ceph-fuse process exiting abruptly, or its parent container being terminated, taking down its child processes with it. This commit adds checks to NodeStageVolume and NodePublishVolume procedures to detect whether a mountpoint in staging_target_path and/or target_path is corrupted, and remount is performed if corruption is detected. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-10 06:05:52 +00:00
Robert Vasek	aa6297e164	cleanup: refactor helper functions in nodeserver.go Refactored a couple of helper functions for easier resue. * Code for building store.VolumeOptions is factored out into a separate function. * Changed args of getCredentailsForVolume() and NodeServer.mount() so that instead of passing in whole csi.NodeStageVolumeRequest, only necessary properties are passed explicitly. This is to allow these functions to be called outside of NodeStageVolume() where NodeStageVolumeRequest is not available. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-10 06:05:52 +00:00
Rakshith R	3a64ee48c3	rbd: return unimplemented error for block-mode reclaimspace req blkdiscard cmd discards all data on the block device which is not desired. Hence, return unimplemented code if the volume access mode is block. Signed-off-by: Rakshith R <rar@redhat.com>	2022-03-03 19:00:49 +00:00
Niels de Vos	1f012004a6	util: configure tenants vaultAuthNamespace if not set When a tenant provides a configuration that includes the `vaultNamespace` option, the `vaultAuthNamespace` option is still taken from the global configuration. This is not wanted in all cases, as the `vaultAuthNamespace` option defauls to the `vaultNamespace` option which the tenant may want to override as well. The following behaviour is now better defined: 1. no `vaultAuthNamespace` in the global configuration: A tenant can override the `vaultNamespace` option and that will also set the `vaultAuthNamespace` option to the same value. 2. `vaultAuthNamespace` and `vaultNamespace` in the global configuration: When both options are set to different values in the global configuration, the tenant `vaultNamespace` option will not override the global `vaultAuthNamespace` option. The tenant can configure `vaultAuthNamespace` with a different value if required. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-03-02 08:36:33 +00:00
Madhu Rajanna	d5c98f81a2	rbd: make image features as optional parameter Makes the rbd images features in the storageclass as optional so that default image features of librbd can be used. and also kept the option to user to specify the image features in the storageclass. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-02-28 13:10:03 +00:00
Madhu Rajanna	fb3835691f	rbd: add support for deep-flatten image feature as deep-flatten is long supported in ceph and its enabled by default in the librbd, providing an option to enable it in cephcsi for the rbd images we are creating. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-02-28 13:10:03 +00:00
Madhu Rajanna	e9802c4940	cephfs: refactor cephfs core functions This commits refactors the cephfs core functions with interfaces. This helps in better code structuring and writing the unit test cases. update #852 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-02-22 20:39:23 +00:00
Madhu Rajanna	46378f3bfc	rbd: log stderror when running modprobe logging the error is not user-friendly and it contains system error message. Log the stderr which is user-friendly error message for identifying the problem. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-02-14 15:03:31 +00:00
Sébastien BERNARD	ee8fb3f05f	rbd: Fix dataPool in createVolumeResponse Return the dataPool used to create the image instead of the default one provided by the createVolumeRequest. In case of topologyConstrainedDataPools, they may differ. Don't add datapool if it's not present Signed-off-by: Sébastien Bernard <sebastien.bernard@sfr.com>	2022-02-10 11:44:22 +00:00
Humble Chirammal	8f6a7da538	cephfs: dont set explicit permissions on the volume At present we are node staging with worldwide permissions which is not correct. We should allow the CO to take care of it and make the decision. This commit also remove `fuseMountOptions` and `KernelMountOptions` as they are no longer needed Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-02-09 17:30:29 +00:00
Madhu Rajanna	2943555904	cephfs: fix omap deletion in DeleteSnapshot the omap is stored with the requested snapshot name not with the subvolume snapshotname. This fix uses the correct snapshot request name to cleanup the omap once the subvolume snapshot is deleted. fixes: #2832 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-02-08 20:37:53 +00:00
Humble Chirammal	ad6a3d7575	rbd: remove kp-metadata register functions of HPCS/Key Protect This commit removes `kp-metadata` registration from existing HPCS or Key Protect code as per the plan. Fix #2816 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-02-08 18:27:03 +00:00
Humble Chirammal	1c3baa0722	rbd: add AAD(additionalAuthData) while unwrapping the DEK As we are using optional additional auth data while wrapping the DEK, we have to send the same additionally while unwrapping. Error: ``` failed to unwrap the DEK: kp.Error: ..(INVALID_FIELD_ERR)', reasons='[INVALID_FIELD_ERR: The field `ciphertext` must be: the original base64 encoded ciphertext from the wrap operation ``` Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-02-08 03:06:30 +00:00
Niels de Vos	f6894909d7	util: use `vaultNamespace` if `vaultAuthNamespace` is not set When a tenant configures `vaultNamespace` in their own ConfigMap, it is not applied to the Vault configuration, unless `vaultAuthNamespace` is set as well. This is unexpected, as the `vaultAuthNamespace` usually is something configured globally, and not per tenant. The `vaultAuthNamespace` is an advanced option, that is often not needed to be configured. Only when tenants have to configure their own `vaultNamespace`, it is possible that they need to use a different `vaultAuthNamespace`. The default for the `vaultAuthNamespace` is now the `vaultNamespace` value from the global configuration. Tenants can still set it to something else in their own ConfigMap if needed. Note that Hashicorp Vault Namespaces are only functional in the Enterprise version of the product. Therefor this can not be tested in the Ceph-CSI e2e with the Open Source version of Vault. Fixes: https://bugzilla.redhat.com/2050056 Reported-by: Rachael George <rgeorge@redhat.com> Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-02-07 08:20:48 +00:00
Rakshith R	3203673d17	cleanup: remove ceph.conf WA options which are already fixed This commit removes ceph.conf WA options: ``` # Workaround for http://tracker.ceph.com/issues/23446 fuse_set_user_groups = false # ceph-fuse which uses libfuse2 by default has write buffer size of 2KiB # adding 'fuse_big_writes = true' option by default to override this limit # see https://github.com/ceph/ceph-csi/issues/1928 fuse_big_writes = true ``` Since they are already fixed. Refer: https://tracker.ceph.com/issues/44885 Refer: https://tracker.ceph.com/issues/23446 Closes: #2825 Signed-off-by: Rakshith R <rar@redhat.com>	2022-02-04 15:42:32 +00:00
Madhu Rajanna	28fef9b379	cleanup: remove thick provisioning code This commit removes the thick provisioning code as thick provisioning is deprecated in cephcsi 3.5.0. fixes: #2795 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-28 11:17:15 +00:00
Humble Chirammal	4ee4fdfebd	rbd: unexport SecretsKMS from KMS implementation This commit unexport SecretsKMS from KMS implementation. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-01-28 06:55:12 +00:00
Humble Chirammal	4058246637	rbd: unexport vaultTokenSA struct from KMS implementation This commit unexport the vaultTokenSA from the vault KMS implementation Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-01-28 06:55:12 +00:00

... 2 3 4 5 6 ...

1016 Commits