ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2024-12-27 07:20:24 +00:00

Author	SHA1	Message	Date
Marcel Lauhoff	4e38bdac10	fscrypt: fsync encrypted dir after setting policy [workaround] Revert once our google/fscrypt dependency is upgraded to a version that includes https://github.com/google/fscrypt/pull/359 gets accepted Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	33c33a8b49	fscrypt: Use constant protector name Use constant protector name 'ceph-csi' instead of constant prefix concatenated with the volume ID. When cloning volumes the ID changes and fscrypt protected directories become inunlockable due to the protector name change Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	97cb1b6672	fscrypt: Update mount info before create context NewContextFrom{Mountpoint,Path} functions use cached `/proc/self/mountinfo` to find mounted file systems by device ID. Since we run fscrypt as a library in a long-lived process the cached information is likely to be stale. Stale entries may map device IDs to mount points of already destroyed RBDs and fail context creation. Updating the cache beforehand prevents this. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	a52314356e	fscrypt: Determine best supported fscrypt policy on node init Currently fscrypt supports policies version 1 and 2. 2 is the best choice and was the only choice prior to this commit. This adds support for kernels < 5.4, by selecting policy version 1 there. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	dd0e1988c0	fscrypt: Fetch passphrase when keyFn is invoked not created Fetch password when keyFn is invoked, not when it is created. This allows creation of the keyFn before actually creating the passphrase. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	a6a4282493	fscrypt: Unlock: Fetch keys early Fetch keys from KMS before doing anything else. This will catch KMS errors before setting up any fscrypt metadata. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	cfea8d7562	fscrypt: fscrypt integration Integrate google/fscrypt into Ceph CSI KMS and encryption setup. Adds dependencies to google/fscrypt and pkg/xattr. Be as generic as possible to support integration with both RBD and Ceph FS. Add the following public functions: InitializeNode: per-node initialization steps. Must be called before Unlock at least once. Unlock: All steps necessary to unlock an encrypted directory including setting it up initially. IsDirectoryUnlocked: Test if directory is really encrypted Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	624905d60d	kms: Add basic GetSecret() test Add rudimentary test to ensure that we can get a valid passphrase from the GetSecret() feature Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	0599089de0	util: Add util to fetch encryption type from vol options Fetch encryption type from vol options. Make fallback type configurable to support RBD (default block) and Ceph FS (default file) Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	fe4821435e	util: Make encryption passphrase size a parameter fscrypt support requires keys longer than 20 bytes. As a preparation, make the new passphrase length configurable, but default to 20 bytes. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Rakshith R	d39d2cffcc	cleanup: use index instead of value while iterating This commit cleans up for loop to use index to access value instead of copying value into a new variable while iterating. ``` internal/util/csiconfig.go:103:2: rangeValCopy: each \ iteration copies 136 bytes (consider pointers or indexing) \ (gocritic) for _, cluster := range config { ``` Signed-off-by: Rakshith R <rar@redhat.com>	2022-08-09 13:36:03 +00:00
Rakshith R	3d3c029471	nfs: add nodeserver within cephcsi This commit adds nfs nodeserver capable of mounting nfs volumes, even with pod networking using NSenter design similar to rbd and cephfs. NodePublish, NodeUnpublish, NodeGetVolumeStats and NodeGetCapabilities have been implemented. The nodeserver implementation has been inspired from https://github.com/kubernetes-csi/csi-driver-nfs, which was previously used for mounted cephcsi exported nfs volumes. The current implementation is also backward compatible for the previously created PVCs. Signed-off-by: Rakshith R <rar@redhat.com>	2022-08-09 13:36:03 +00:00
Niels de Vos	011d4fc81c	cleanup: create k8s.io/mount-utils Mounter only once Recently the k8s.io/mount-utils package added more runtime dectection. When creating a new Mounter, the detect is run every time. This is unfortunate, as it logs a message like the following: ``` mount_linux.go:283] Detected umount with safe 'not mounted' behavior ``` This message might be useful, so it probably good to keep it. In Ceph-CSI there are various locations where Mounter instances are created. Moving that to the DefaultNodeServer type reduces it to a single place. Some utility functions need to accept the additional parameter too, so that has been modified as well. See-also: kubernetes/kubernetes#109676 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-07-21 07:14:43 +00:00
takeaki-matsumoto	1025871021	cephfs: Support mount option on nodeplugin add mount options on nodeplugin side Signed-off-by: takeaki-matsumoto <takeaki.matsumoto@linecorp.com>	2022-07-18 22:04:12 +00:00
Madhu Rajanna	f171143135	cephfs: round to cephfs size to multiple of 4Mib Due to the bug in the df stat we need to round off the subvolume size to align with 4Mib. Note:- Minimum supported size in cephcsi is 1Mib, we dont need to take care of Kib. fixes #3240 More details at https://github.com/ceph/ceph/pull/46905 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-07-13 18:32:40 +00:00
Niels de Vos	14ba1498bf	util: reduce systemd related errors while mounting There are regular reports that identify a non-error as the cause of failures. The Kubernetes mount-utils package has detection for systemd based environments, and if systemd is unavailable, the following error is logged: Cannot run systemd-run, assuming non-systemd OS systemd-run output: System has not been booted with systemd as init system (PID 1). Can't operate. Failed to create bus connection: Host is down, failed with: exit status 1 Because of the `failed` and `exit status 1` error message, users might assume that the mounting failed. This does not need to be the case. The container-images that the Ceph-CSI projects provides, do not use systemd, so the error will get logged with each mount attempt. By using the newer MountSensitiveWithoutSystemd() function from the mount-utils package where we can, the number of confusing logs get reduced. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-07-04 10:02:54 +00:00
Prasanna Kumar Kalever	caf4090657	rbd: provide option to disable setting metadata on rbd images As we added support to set the metadata on the rbd images created for the PVC and volume snapshot, by default metadata is set on all the images. As we have seen we are hitting issues#2327 a lot of times with this, we start to leave a lot of stale images. Currently, we rely on `--extra-create-metadata=true` to decide to set the metadata or not, we cannot set this option to false to disable setting metadata because we use this for encryption too. This changes is to provide an option to disable setting the image metadata when starting cephcsi. Fixes: #3009 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-28 19:12:53 +00:00
Madhu Rajanna	7a2dd4c3cf	rbd: create token and use it for vault SA create the token if kubernetes version in 1.24+ and use it for vault sa. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Rakshith R <rar@redhat.com>	2022-06-17 11:37:59 +00:00
Prasanna Kumar Kalever	2880c25fd6	rbd: set cluster Name as metadata on the image This change helps read the cluster name from the cmdline args, the provisioner will set the same on the RBD images. Fixes: #2973 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-08 16:23:59 +00:00
Prasanna Kumar Kalever	deb003e605	cleanup: use prefix instead of hardcoding csiParameterPrefix Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-08 16:23:59 +00:00
Madhu Rajanna	1952a9b4b3	ci: fix all linter errors found in golangci-lint Fixing all the linter errors found in golang-ci lint v1.46.2 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-03 12:55:54 +00:00
Prasanna Kumar Kalever	bac33262ae	rbd: add unset volume/snapshot metadata utility functions Added GetVolumeMetadataKeys() GetSnaoshotMetadataKeys() unsetVolumeMetadata() and unsetSnapshotMetadata() functions. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-05-12 15:54:09 +00:00
Marcus Röder	a95a6213eb	util: support systems using the new cgroup v2 structure With cgroup v2, the location of the pids.max file changed and so did the /proc/self/cgroup file new /proc/self/cgroup file ` 0::/user.slice/user-500.slice/session-14.scope ` old file: ` 11:pids:/user.slice/user-500.slice/session-2.scope 10:blkio:/user.slice 9:net_cls,net_prio:/ 8:perf_event:/ ... ` There is no directory per subsystem (e.g. /sys/fs/cgroup/pids) any more, all files are now in one directory. fixes: https://github.com/ceph/ceph-csi/issues/3085 Signed-off-by: Marcus Röder <m.roeder@yieldlab.de>	2022-05-07 20:38:48 +00:00
Madhu Rajanna	d2bc9743f7	cephfs: add netNamespaceFilePath for CephFS as same host directory is not shared between the cephfs and the rbd plugin pod. we need to keep the netNamespaceFilePath separately for both cephfs and rbd. CephFS plugin will use this path to execute mount -t commands. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-19 12:28:46 +00:00
Madhu Rajanna	eb4bfb7326	cleanup: use block comment for ClusterInfo example Adjusted the mix of tabs and the spaces and also used block comment for better readability. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-19 12:28:46 +00:00
Madhu Rajanna	b4acbd08a5	rbd: move radosNamespace to RBD section As radosNamespace is more specific to RBD not the general ceph configuration. Now we introduced a new RBD section for RBD specific options, Moving the radosNamespace to RBD section and keeping the radosNamespace still under the global ceph level configration for backward compatibility. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-19 12:28:46 +00:00
Madhu Rajanna	766346868e	util: Add RBD specific options in clusterInfo As the netNamespaceFilePath can be separate for both cephfs and rbd adding the netNamespaceFilePath path for RBD, This will help us to keep RBD and CephFS specific options separately. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-19 12:28:46 +00:00
Madhu Rajanna	c245436ec4	util: fix logging in ExecuteCommandWithNSEnter log the nsenter and its argument after executing the command with the nsenter CLI. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-14 12:17:21 +00:00
Niels de Vos	28369702d2	nfs: use go-ceph API for creating/deleting exports Recent versions of Ceph allow calling the NFS-export management functions over the go-ceph API. This seems incompatible with older versions that have been tested with the `ceph nfs` commands that this commit replaces. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-04-14 08:01:45 +00:00
Humble Chirammal	959df4dbac	doc: correct typos in struct field comments and release.md corrected strings in the release guide and util server. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-04-11 06:23:25 +00:00
Prasanna Kumar Kalever	41fe2c7dda	rbd: set metadata on the snapshot Set snapshot-name/snapshot-namespace/snapshotcontent-name details on RBD backend snapshot image as metadata on snapshot Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	ae5925f04c	rbd: update PV/PVC metadata on a reattach of PV Example if a PVC was delete by setting `persistentVolumeReclaimPolicy` as `Retain` on PV, and PV is reattached to a new PVC, we make sure to update PV/PVC image metadata on a PV reattach. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Prasanna Kumar Kalever	4d750ed0e5	rbd: add set/Get VolumeMetadata() utility function Define and use PV and PVC metadata keys used by external provisioner. The CSI external-provisioner (v1.6.0+) introduces the --extra-create-metadata flag, which automatically sets map<string, string> parameters in the CSI CreateVolumeRequest. Add utility functions to set/Get PV/PVC/PVCNamespace metadata on image Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-04-08 15:43:14 +00:00
Madhu Rajanna	7b2aef0d81	util: add support for the nsenter add support to run rbd map and mount -t commands with the nsenter. complete design of pod/multus network is added here https://github.com/rook/rook/ blob/master/design/ceph/multus-network.md#csi-pods Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-04-08 10:23:21 +00:00
Thibaut Blanchard	e874c9c11b	rbd: fix topology snapshot pool Restoring a snapshot with a new PVC results with a wrong dataPoolName in case of initial volume linked to a storageClass with topology constraints and erasure coding. Signed-off-by: Thibaut Blanchard <thibaut.blanchard@gmail.com>	2022-03-30 04:40:30 +00:00
Robert Vasek	f6ae612003	util: added reference tracker RT, reference tracker, is key-based implementation of a reference counter. Unlike an integer-based counter, RT counts references by tracking unique keys. This allows accounting in situations where idempotency must be preserved. It guarantees there will be no duplicit increments or decrements of the counter. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-27 19:24:26 +00:00
Madhu Rajanna	366c2ace31	util: add helper to get pvcnamespace from input added helper function to return the pvc namespace name from the input parameters. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Madhu Rajanna	772fe8d6c8	util: add helper function to strip kube parameters added helper function to strip the kubernetes specific parameters from the volumeContext as volumeContext is storaged in the PV volumeAttributes Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-03-21 08:54:43 +00:00
Robert Vasek	80dda7cc30	cephfs: detect corrupt ceph-fuse mounts and try to remount Mounts managed by ceph-fuse may get corrupted by e.g. the ceph-fuse process exiting abruptly, or its parent container being terminated, taking down its child processes with it. This commit adds checks to NodeStageVolume and NodePublishVolume procedures to detect whether a mountpoint in staging_target_path and/or target_path is corrupted, and remount is performed if corruption is detected. Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-03-10 06:05:52 +00:00
Rakshith R	3203673d17	cleanup: remove ceph.conf WA options which are already fixed This commit removes ceph.conf WA options: ``` # Workaround for http://tracker.ceph.com/issues/23446 fuse_set_user_groups = false # ceph-fuse which uses libfuse2 by default has write buffer size of 2KiB # adding 'fuse_big_writes = true' option by default to override this limit # see https://github.com/ceph/ceph-csi/issues/1928 fuse_big_writes = true ``` Since they are already fixed. Refer: https://tracker.ceph.com/issues/44885 Refer: https://tracker.ceph.com/issues/23446 Closes: #2825 Signed-off-by: Rakshith R <rar@redhat.com>	2022-02-04 15:42:32 +00:00
Madhu Rajanna	28fef9b379	cleanup: remove thick provisioning code This commit removes the thick provisioning code as thick provisioning is deprecated in cephcsi 3.5.0. fixes: #2795 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-28 11:17:15 +00:00
Madhu Rajanna	9c841c83d4	cleanup: rename errorPair to pairError to fix the errname check renaming the struct. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-24 05:25:11 +00:00
Madhu Rajanna	c67bacdb11	cleanup: use %s instead of %w for t.Errorf As t.Errorf does not support error-wrapping directive using %s. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-24 05:25:11 +00:00
Madhu Rajanna	813f6c30cc	cleanup: use WriteString instead of Write use WriteString instead of Write for the temp files. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-24 05:25:11 +00:00
Madhu Rajanna	aba6979d29	cleanup: use os.ReadFile to read file as ioutil.ReadFile is deprecated and suggestion is to use os.ReadFile as per https://pkg.go.dev/io/ioutil updating the same. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-24 05:25:11 +00:00
Madhu Rajanna	562dff0d19	cleanup: use os.WriteFile to write files as ioutil.WriteFile is deprecated and suggestion is to use os.WriteFile as per https://pkg.go.dev/io/ioutil updating the same. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-01-24 05:25:11 +00:00
Rakshith R	c19264e996	rbd: add function (cc *ClusterConnection) GetTaskAdmin() This function returns new go-ceph TaskAdmin to add tasks on rbd volumes. Signed-off-by: Rakshith R <rar@redhat.com>	2022-01-06 12:28:18 +00:00
Madhu Rajanna	95e9595c1f	util: add helper ExecCommandWithTimeout function added ExecCommandWithTimeout helper function to execute the commands with the timeout option, if the command does not return any response with in the timeout time the process will be terminated and error will be returned back to the user. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-12-23 13:36:21 +00:00
Humble Chirammal	88911eb4e9	rbd: add migration secret support to controllerserver functions This commit adds the migration secret request validation to expand, create controller functions. Ref # https://github.com/ceph/ceph-csi/issues/2509 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-12-20 07:34:43 +00:00
Niels de Vos	e76bffe353	cleanup: import k8s.io/mount-utils instead of k8s.io/utils/mount k8s.io/utils/mount has moved to k8s.io/mount-utils, and Ceph-CSI uses that already in most locations. Only internal/util/util.go still imports the old path. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-12-09 17:58:34 +00:00
Niels de Vos	b3910f2b4a	rbd: enable CSI-Addons Server and Identity Service Add a new endpoint for the CSI-Addons Service and enable the Identity Service for the RBD plugin. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-11-30 11:48:40 +00:00
Madhu Rajanna	d05fc1e8e5	util: add helper to get the cluster ID added helper function to get the cluster ID. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-11-19 09:38:59 +05:30
Madhu Rajanna	0f0cda49a7	rbd: log stdError for cryptosetup command If we hit any error while running the cryptosetup commands we are logging only the error message. with only error message it is difficult to analyze the problem, logging the stdError will help us to check what is the problem. updates: #2610 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-11-18 02:17:15 +00:00
Prasanna Kumar Kalever	50e9dfa5c5	cleanup: fix log level This log line is seen frequently in the logs and its better to be at Warning loglevel rather than Error based on its severity E1109 08:30:45.612395 38328 util.go:247] kernel 4.19.202 does not support required features Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-11-10 10:54:29 +00:00
Humble Chirammal	ff0911fb6a	rbd: add unittests for IsMigrationSecret and ParseAndSetSecretMapFromMigSecret This commit adds unit tests for newly introduced migration specific functions. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-10-27 18:35:00 +00:00
Humble Chirammal	b49bf4b987	rbd: parse migration secret and set it for controller server operations This commit adds a couple of helper functions to parse the migration request secret and set it for further csi driver operations. More details: The intree secret has a data field called "key" which is the base64 admin secret key. The ceph CSI driver currently expect the secret to contain data field "UserKey" for the equivalant. The CSI driver also expect the "UserID" field which is not available in the in-tree secret by deafult. This missing userID will be filled (if the username differ than 'admin') in the migration secret as 'adminId' field in the migration request, this commit adds the logic to parse this migration secret as below: "key" field value will be picked up from the migraion secret to "UserKey" field. "adminId" field value will be picked up from the migration secret to "UserID" field if `adminId` field is nil or not set, `UserID` field will be filled with default value ie `admin`.The above logic get activated only when the secret is a migration secret, otherwise skipped to the normal workflow as we have today. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-10-27 18:35:00 +00:00
Humble Chirammal	c584fa20da	rbd: use clusterID from volumeContext at nodestage previously we were retriving clusterID using the monitors field in the volume context at node stage code path. however it is possible to retrieve or use clusterID directly from the volume context. This commit also remove the getClusterIDFromMigrationVolume() function which was used previously and its tests Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-10-11 10:06:30 +00:00
Madhu Rajanna	b1ef842640	cleanup: move core functions to core pkg as we are refractoring the cephfs code, Moving all the core functions to a new folder /pkg called core. This will make things easier to implement. For now onwards all the core functionalities will be added to the core package. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-09-23 06:39:37 +00:00
Humble Chirammal	4804f47b18	e2e: Add e2e for rbd migration static pvc This commit adds e2e for rbd migration static PVCs Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-09-20 09:54:54 +00:00
Humble Chirammal	2e8e8f5e64	rbd: fill clusterID if its a migration nodestage request the migration nodestage request does not carry the 'clusterID' in it and only monitors are available with the volumeContext. The volume context flag 'migration=true' and 'static=true' flags allow us to fill 'clusterID' from the passed in monitors to the volume Context,so that rest of the static operations on nodestage can be proceeded as we do treat static volumes today. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-09-20 09:54:54 +00:00
Humble Chirammal	1f5963919f	util: get clusterID for the passed in mon string as part of migration support, the clusterID has to be fetched from passed in mon. Because the intree RBD storage class only got monitor and not `clusterID` parameter support. However, in CSI, SC has the `clusterID` parameter support but not mon. Due to that we have to fetch the clusterID from config file for the passed in mon and use it in our operations. This adds a helper function to retrieve clusterID from passed in mon string. Updates https://github.com/ceph/ceph-csi/issues/2509 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-09-20 09:54:54 +00:00
Prasanna Kumar Kalever	ad2c6d2851	util: add gzip helper function Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-09-16 13:55:15 +00:00
Rakshith R	82d09d81cf	util: modify GetMonsAndClusterID() to take clusterID instead of options This commit: - modifies GetMonsAndClusterID() to take clusterID instead of options. - moves out validation of clusterID is set or not out of GetMonsAndClusterID(). - defines ErrClusterIDNotSet new error for reusability. - add GetClusterID() to obtain clusterID from options. Signed-off-by: Rakshith R <rar@redhat.com>	2021-09-14 08:39:57 +00:00
Rakshith R	9d1e98ca60	rbd: check for clusterid mapping in genVolFromVolumeOptions() This commit adds capability to genVolFromVolumeOptions() to fetch mapped clusted-id & mon ips for mirrored PVC on secondary cluster which may have different cluster-id. This is required for NodeStageVolume(). We also don't need to check for mapping during volume create requests, so it can be disabled by passing a bool checkClusterIDMapping as false. GetMonsAndClusterID() is modified to accept bool checkClusterIDMapping based on which clustermapping is checked to fetch mapped cluster-id and mon-ips. Signed-off-by: Rakshith R <rar@redhat.com>	2021-09-14 08:39:57 +00:00
Rakshith R	e99dd3dea4	util: read ceph.conf by calling conn.ReadConfigFile(CephConfigPath) The configurations in cpeh.conf is not picked up by rados connection automatically, hence we need to call conn.ReadConfigFile before calling Connect(). Signed-off-by: Rakshith R <rar@redhat.com>	2021-09-07 16:50:12 +00:00
Madhu Rajanna	4865061ab9	util: create ceph configuration files if not present create ceph.conf and keyring files if its not present in the /et/ceph/ path. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-09-03 14:14:43 +00:00
Niels de Vos	60c2afbcca	util: NewK8sClient() should not panic on non-Kubernetes clusters When NewK8sClient() detects and error, it used to call FatalLogMsg() which causes a panic. There are additional features that can be used on Kubernetes clusters, but these are not a requirement for most functionalities of the driver. Instead of causing a panic, returning an error should suffice. This allows using the driver on non-Kubernetes clusters again. Fixes: #2452 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-09-02 11:22:14 +00:00
Rakshith R	99168dc822	rbd: check for clusterid mapping in RegenerateJournal() This commit adds fetchMappedClusterIDAndMons() which returns monitors and clusterID info after checking cluster mapping info. This is required for regenerating omap entries in mirrored cluster with different clusterID. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-31 14:30:06 +00:00
Rakshith R	496bcba85c	rbd: move GetMappedID() to util package This commit moves getMappedID() from rbd to util package since it is not rbd specific and exports it from there. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-31 14:30:06 +00:00
Niels de Vos	4a3b1181ce	cleanup: move KMS functionality into its own package A new "internal/kms" package is introduced, it holds the API that can be consumed by the RBD components. The KMS providers are currently in the same package as the API. With later follow-up changes the providers will be placed in their own sub-package. Because of the name of the package "kms", the types, functions and structs inside the package should not be prefixed with KMS anymore: internal/kms/kms.go:213:6: type name will be used as kms.KMSInitializerArgs by other packages, and that stutters; consider calling this InitializerArgs (golint) Updates: #852 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-30 16:31:40 +00:00
Niels de Vos	778b5e86de	cleanup: move k8s functions to the util/k8s package By placing the NewK8sClient() function in its own package, the KMS API can be split from the "internal/util" package. Some of the KMS providers use the NewK8sClient() function, and this causes circular dependencies between "internal/utils" -> "internal/kms" -> "internal/utils", which are not alowed in Go. Updates: #852 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-30 16:31:40 +00:00
Niels de Vos	6d00b39886	cleanup: move log functions to new internal/util/log package Moving the log functions into its own internal/util/log package makes it possible to split out the humongous internal/util packages in further smaller pieces. This reduces the inter-dependencies between utility functions and components, preventing circular dependencies which are not allowed in Go. Updates: #852 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-26 09:34:05 +00:00
Niels de Vos	68588dc7df	util: fix unit-test for GetClusterMappingInfo() Unit-testing often fails due to a race condition while writing the clusterMappingConfigFile from multiple go-routines at the same time. Failures from `make containerized-test` look like this: === CONT TestGetClusterMappingInfo/site2-storage_cluster-id_mapping cluster_mapping_test.go:153: GetClusterMappingInfo() = <nil>, expected data &[{map[site1-storage:site2-storage] [map[1:3]] [map[11:5]]} {map[site3-storage:site2-storage] [map[8:3]] [map[10:5]]}] === CONT TestGetClusterMappingInfo/site3-storage_cluster-id_mapping cluster_mapping_test.go:153: GetClusterMappingInfo() = <nil>, expected data &[{map[site3-storage:site2-storage] [map[8:3]] [map[10:5]]}] --- FAIL: TestGetClusterMappingInfo (0.01s) --- PASS: TestGetClusterMappingInfo/mapping_file_not_found (0.00s) --- PASS: TestGetClusterMappingInfo/mapping_file_found_with_empty_data (0.00s) --- PASS: TestGetClusterMappingInfo/cluster-id_mapping_not_found (0.00s) --- FAIL: TestGetClusterMappingInfo/site2-storage_cluster-id_mapping (0.00s) --- FAIL: TestGetClusterMappingInfo/site3-storage_cluster-id_mapping (0.00s) --- PASS: TestGetClusterMappingInfo/site1-storage_cluster-id_mapping (0.00s) By splitting the public GetClusterMappingInfo() function into an internal getClusterMappingInfo() that takes a filename, unit-testing can use different files for each go-routine, and testing becomes more predictable. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-25 16:08:48 +00:00
Rakshith R	f05ac2b25d	rbd: extract kmsID from volumeAttributes in RegenerateJournal() This commit adds functionality of extracting encryption kmsID, owner from volumeAttributes in RegenerateJournal() and adds utility functions ParseEncryptionOpts and FetchEncryptionKMSID. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-10 09:17:59 +00:00
Madhu Rajanna	6cc37f0a17	cleanup: use different file name for testing For clusterMappingConfigFile using different file name so that multiple unit test cases can work without any data race. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-09 13:37:25 +00:00
Madhu Rajanna	916c97b4a8	rbd: copy creds when copying the connection rbd flatten functions is a CLI call and it expects the creds as the input and copying of creds is required when we generate the temp clone image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-09 07:28:43 +00:00
Niels de Vos	b5d2321d57	cleanup: use vaultDefaultCAVerify to set default value Golang-ci complains about the following: internal/util/vault_tokens.go:99:20: string `true` has 4 occurrences, but such constant `vaultDefaultDestroyKeys` already exists (goconst) v.VaultCAVerify = "true" ^ This occurence of "true" can be replaced by vaultDefaultCAVerify so address the warning. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-06 12:19:18 +00:00
Niels de Vos	f584db41e6	util: add vaultDestroyKeys option to destroy Vault kv-v2 secrets Hashicorp Vault does not completely remove the secrets in a kv-v2 backend when the keys are deleted. The metadata of the keys will be kept, and it is possible to recover the contents of the keys afterwards. With the new `vaultDestroyKeys` configuration parameter, this behaviour can now be selected. By default the parameter will be set to `true`, indicating that the keys and contents should completely be destroyed. Setting it to any other value will make it possible to recover the deleted keys. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-06 12:19:18 +00:00
Madhu Rajanna	ac11d71e19	util: add helper function to read clusterID mapping added helper function to read the clusterID mapping from the mounted file. The clusterID mapping contains below mappings * ClusterID mappings (to cluster to which we are failingover and from which cluster failover happened) * RBD PoolID mapping of between the clusters. * CephFS FscID mapping between the clusters. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-05 16:07:51 +00:00
Niels de Vos	4859f2dfdb	util: allow configuring VAULT_AUTH_MOUNT_PATH for Vault Tenant SA KMS The VAULT_AUTH_MOUNT_PATH is a Vault configuration parameter that allows a user to set a non default path for the Kubernetes ServiceAccount integration. This can already be configured for the Vault KMS, and is now added to the Vault Tenant SA KMS as well. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-05 06:02:57 +00:00
Niels de Vos	f2d5c2e0df	util: add vaultAuthNamespace option for Vault KMS The new `vaultAuthNamespace` configuration parameter can be set to the Vault Namespace where the authentication is setup in the service. Some Hashicorp Vault deployments use sub-namespaces for their users/tenants, with a 'root' namespace where the authentication is configured. This requires passing of different Vault namespaces for different operations. Example: - the Kubernetes Auth mechanism is configured for in the Vault Namespace called 'devops' - a user/tenant has a sub-namespace called 'devops/website' where the encryption passphrases can be placed in the key-value store The configuration for this, then looks like: vaultAuthNamespace: devops vaultNamespace: devops/homepage Note that Vault Namespaces are a feature of the Hashicorp Vault Enterprise product, and not part of the Open Source version. This prevents adding e2e tests that validate the Vault Namespace configuration. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-04 18:20:45 +00:00
Niels de Vos	83167e2ac5	util: correct error message when connecting to Vault fails Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-04 18:20:45 +00:00
Artur Troian	16ec97d8f7	util: getCgroupPidsFile produces striped path when extra : present This commit uses `string.SplitN` instead of `string.Split`. The path for pids.max has extra `:` symbols in it due to which getCgroupPidsFile() splits the string into 5 tokens instead of 3 leading to loss of part of the path. As a result, the below error is reported: `Failed to get the PID limit, can not reconfigure: open /sys/fs/cgroup/pids/system.slice/containerd.service/ kubepods-besteffort-pod183b9d14_aed1_4b66_a696_da0c738bc012.slice/pids.max: no such file or directory` SplitN takes an argument n and splits the string accordingly which helps us to get the desired file path. Fixes: #2337 Co-authored-by: Yati Padia <ypadia@redhat.com> Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-08-03 06:03:10 +00:00
Prasanna Kumar Kalever	526ff95f10	rbd: add support to expand encrypted volume Previously in ControllerExpandVolume() we had a check for encrypted volumes and we use to fail for all expand requests on an encrypted volume. Also for Block VolumeMode PVCs NodeExpandVolume used to be ignored/skipped. With these changes, we add support for the expansion of encrypted volumes. Also for raw Block VolumeMode PVCs with Encryption we call NodeExpandVolume. That said, With LUKS1, cryptsetup utility doesn't prompt for a passphrase on resizing the crypto mapper device. This is because LUKS1 devices don't use kernel keyring for volume keys. Whereas, LUKS2 devices use kernel keyring for volume key by default, i.e. cryptsetup utility asks for a passphrase if it detects volume key was previously passed to dm-crypt via kernel keyring service, we are overriding the default by --disable-keyring option during cryptsetup open command. So that at the time of crypto mapper device resize we will not be prompted for any passphrase. Fixes: #1469 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-23 10:00:23 +00:00
Prasanna Kumar Kalever	4fa05cb3a1	util: add helper functions for resize of encrypted volume such as: ResizeEncryptedVolume() and LuksResize() Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-23 10:00:23 +00:00
Prasanna Kumar Kalever	572f39d656	util: fix log level in OpenEncryptedVolume() Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-23 10:00:23 +00:00
Prasanna Kumar Kalever	812003eb45	util: fix bug in DeviceEncryptionStatus() With Luks1 device: $ cryptsetup status /dev/mapper/crypto-rbd0 /dev/mapper/crypto-rbd0 is active and is in use. type: LUKS1 cipher: aes-xts-plain64 keysize: 512 bits key location: dm-crypt device: /dev/rbd0 sector size: 512 offset: 4096 sectors size: 4190208 sectors mode: read/write With Luks2 device: $ cryptsetup status /dev/mapper/crypto-rbd0 /dev/mapper/crypto-rbd0 is active and is in use. type: LUKS2 cipher: aes-xts-plain64 keysize: 512 bits key location: dm-crypt device: /dev/rbd0 sector size: 512 offset: 32768 sectors size: 4161536 sectors mode: read/write This could lead to failures with unmap in the NodeUnstageVolume path for the encrypted volumes. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-23 10:00:23 +00:00
Yati Padia	1ae2afe208	cleanup: modifies the error caused due to merged PRs This commit modifies the error of godot, cyclop, paralleltest linter caused due to merged PRs. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-22 18:15:48 +00:00
Yati Padia	172b66f73f	cleanup: resolves cyclop linter issue this commit adds `// nolint:cyclop` for the fucntions whose complexity is above 20 Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-22 18:15:48 +00:00
Humble Chirammal	abe6a6e5ac	util: remove deleteLock test as it is enforced by the controller Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-22 15:07:49 +00:00
Humble Chirammal	c42d4768ca	util: remove the deleteLock acquistion check for clone and snapshot At present while acquiring the deleteLock on the volume, we check for ongoing clone and snapshot creation operations on the same. Considering snapshot and clone controllers does not allow parent volume deletion on subjected operations, we can be free from this extra check. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-22 15:07:49 +00:00
Niels de Vos	82557e3f34	util: allow configuring VAULT_BACKEND for Vault connection It seems that the version of the key/value engine can not always be detected for Hashicorp Vault. In certain cases, it is required to configure the `VAULT_BACKEND` (or `vaultBackend`) option so that a successful connection to the service can be made. The `kv-v2` is the current default for development deployments of Hashicorp Vault (what we use for automated testing). Production deployments default to version 1 for now. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-22 13:02:47 +00:00
Rakshith R	43f753760b	cleanup: resolve nlreturn linter issues nlreturn linter requires a new line before return and branch statements except when the return is alone inside a statement group (such as an if statement) to increase code clarity. This commit addresses such issues. Updates: #1586 Signed-off-by: Rakshith R <rar@redhat.com>	2021-07-22 06:05:01 +00:00
Yati Padia	3469dfc753	cleanup: resolve errorlint issues This commit resolves errorlint issues which checks for the code that will cause problems with the error wrapping scheme. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-19 13:31:29 +00:00
Humble Chirammal	bd947bbe31	util: remove deleteLock check while acquiring snapshot createLock snapshot controller make sure the pvc which is the source for the snapshot request wont get deleted while snapshot is getting created, so we dont need to check for any ongoing delete operation here on the volume. Subjected code path in snapshot controller: ``` pvc, err := ctrl.getClaimFromVolumeSnapshot(snapshot) . .. pvcClone.ObjectMeta.Finalizers = append(pvcClone.ObjectMeta.Finalizers, utils.PVCFinalizer) _, err = ctrl.client.CoreV1().PersistentVolumeClaims(pvcClone.Namespace).Update(..) ``` Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-17 10:23:13 +00:00
Prasanna Kumar Kalever	b6a88dd728	rbd: add volume healer Problem: ------- For rbd nbd userspace mounter backends, after a restart of the nodeplugin all the mounts will start seeing IO errors. This is because, for rbd-nbd backends there will be a userspace mount daemon running per volume, post restart of the nodeplugin pod, there is no way to restore the daemons back to life. Solution: -------- The volume healer is a one-time activity that is triggered at the startup time of the rbd nodeplugin. It navigates through the list of volume attachments on the node and acts accordingly. For now, it is limited to nbd type storage only, but it is flexible and can be extended in the future for other backend types as needed. From a few feets above: This solves a severe problem for nbd backed csi volumes. The healer while going through the list of volume attachments on the node, if finds the volume is in attached state and is of type nbd, then it will attempt to fix the rbd-nbd volumes by sending a NodeStageVolume request with the required volume attributes like secrets, device name, image attributes, and etc.. which will finally help start the required rbd-nbd daemons in the nodeplugin csi-rbdplugin container. This will allow reattaching the backend images with the right nbd device, thus allowing the applications to perform IO without any interruptions even after a nodeplugin restart. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Humble Chirammal	94c5c5e119	util: remove deleteLock while we acquire clone operation lock clone controller make sure there is no delete operation happens on the source PVC which has been referred as the datasource of clone PVC, we are safe to operate without looking at delete operation lock in this case. Subjected code in the controller: ... if claim.Spec.DataSource != nil && rc.clone { err = p.setCloneFinalizer(ctx, claim) ... } if !checkFinalizer(claim, pvcCloneFinalizer) { claim.Finalizers = append(claim.Finalizers, pvcCloneFinalizer) _, err := p.client.CoreV1().PersistentVolumeClaims(claim.Namespace).Update(..claim..) } Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-16 12:32:28 +00:00
Yati Padia	f36d611ef9	cleanup: resolves gofumpt issues of internal codes This PR runs gofumpt for internal folder. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-14 19:50:56 +00:00
Niels de Vos	d941e5abac	util: make parseTenantConfig() usable for modular KMSs parseTenantConfig() only allowed configuring a defined set of options, and KMSs were not able to re-use the implementation. Now, the function parses the ConfigMap from the Tenants Namespace and returns a map with options that the KMS supports. The map that parseTenantConfig() returns can be inspected by the KMS, and applied to the vaultTenantConnection type by calling parseConfig(). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-13 17:16:35 +00:00
Niels de Vos	3d7d48a4aa	util: VaultTenantSA KMS implementation This new KMS uses a Kubernetes ServiceAccount from a Tenant (Namespace) to connect to Hashicorp Vault. The provisioner and node-plugin will check for the configured ServiceAccount and use the token that is located in one of the linked Secrets. Subsequently the Vault connection is configured to use the Kubernetes token from the Tenant. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-13 17:16:35 +00:00

1 2 3 4 5 ...

308 Commits