ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-04-11 18:13:00 +00:00

Author	SHA1	Message	Date
Rakshith R	1bb78fdf43	e2e: validate PVC-PVC clone creation with deleted parent snap This commit modifies a test case to check creation of PVC-PVC clone of a restored PVC when parent snapshot is deleted. Signed-off-by: Rakshith R <rar@redhat.com>	2024-04-23 12:04:59 +00:00
Rakshith R	c34b31ee05	rbd: add ParentInTrash parameter in rbdImage struct This commit adds ParentInTrash parameter in rbdImage struct and makes use of it in getParent() function in order to avoid error in case the parent is present but in trash. Signed-off-by: Rakshith R <rar@redhat.com>	2024-04-23 12:04:59 +00:00
Madhu Rajanna	4c2d2caf9f	util: add support to configure mirror daemon count Currently we are assuming that only one rbd mirror daemon running on the ceph cluster but that is not true for many cases and it can be more that one, this PR make this as a configurable parameter. fixes: #4312 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-04-22 09:49:59 +00:00
Madhu Rajanna	8c4a38eec6	rbd: address golangci-lint issues addressing golangci-lint issues in rbd related code. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-04-09 06:55:23 +00:00
Praveen M	c1467242c6	cleanup: use slices package This commit replaces the user implemented function `CheckSliceContains()` with `slices.Contains()` function introduced in Go 1.21. Signed-off-by: Praveen M <m.praveen@ibm.com>	2024-04-05 12:18:00 +00:00
Praveen M	3538b23794	rbd: remove topologyConstrainedPools parameter This commit removes the `topologyConstrainedPools` parameter from PV volumeAttributes as it is not required. Signed-off-by: Praveen M <m.praveen@ibm.com>	2024-04-05 12:18:00 +00:00
Niels de Vos	5a6556c4d4	cleanup: destroy connections after .Copy() an other one Everytime a connection is copied with the .Copy() function, it needs to be destroyed once the object is not needed anymore. This was not done consistently, a few more locations require the freeing of the connection resources. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-04-02 11:11:20 +00:00
Niels de Vos	3df396e6f1	rbd: add extra logging while cleaning up snapshots Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-28 11:54:28 +00:00
Niels de Vos	ba05c0f5f1	cleanup: reformat generateVolFromSnap() to rbdSnapshot.toVolume() Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-28 11:54:28 +00:00
Niels de Vos	a517290ea7	rbd: let parseVolCreateRequest() return a connected rbdVolume By returning a connected rbdVolume in parseVolCreateRequest(), the CreateVolume() function can be simplified a little. There is no need to call the additional Connect() and detect failures with it. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-28 11:54:28 +00:00
Niels de Vos	7b2b125b18	rbd: free snapshot resources after allocation Not all snapshot objects are free'd correctly after they were allocated. It is possible that some connections to the Ceph cluster were never closed. This does not need to be a noticeable problem, as connections are re-used where possible, but it isn't clean either. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-28 11:54:28 +00:00
Niels de Vos	18162c71bc	cleanup: do not pass an empty snapshot to genSnapFromSnapID() Just like GenVolFromVolID() the genSnapFromSnapID() function can return a snapshot. There is no need to allocated an empty snapshot and pass that to the genSnapFromSnapID() function. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-28 11:54:28 +00:00
parth-gr	063319f6e5	rbd: make pool optional in rbd sc if topologyconstraints are present if rbd storage class is created with topologyconstraintspools replicated pool was still mandatory, making the pool optional if the topologyconstraintspools is requested Closes: https://github.com/ceph/ceph-csi/issues/4380 Signed-off-by: parth-gr <partharora1010@gmail.com>	2024-03-22 13:15:50 +00:00
Niels de Vos	991343d9e5	cleanup: do not pass EncodingVersion to `GenerateVolID()` The only encoding version that exists is `1`. There is no need to have multiple constants for that version across different packages. Because there is only one version, `GenerateVolID()` does not really require it, and it can use a default version. If there is a need in the future to support an other encoding version, this can be revisited with a cleaner solution. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-03-18 06:23:28 +00:00
muxuelan	4f04748c87	rbd: support nbd on euler or arm Signed-off-by: muxuelan <muxuelan@cmss.chinamobile.com>	2024-03-15 10:39:50 +00:00
Praveen M	e345b26340	cleanup: refactor functions to accept a context parameter Signed-off-by: Praveen M <m.praveen@ibm.com>	2024-03-12 13:54:19 +00:00
Niels de Vos	3bf5c0e478	cleanup: simplify `rbdGetDeviceList()` The `rbdGetDeviceList()` function uses two very similar types for converting krbd and NBD device information from JSON. There is no need to use this distinction, and callers of `rbdGetDeviceList()` should not need to care about it either. By introducing a `deviceInfo` interface with Get-functions, the `rbdGetDeviceList()` function becomes a little simpler, with a clearly defined API for the returned list. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-01-11 18:00:41 +00:00
Praveen M	50e505619c	deploy: added json field tags for csi config map This commit adds the json field tags for csi config map for encoding and decoding JSON. Signed-off-by: Praveen M <m.praveen@ibm.com>	2023-12-21 17:44:46 +00:00
Jan Nemcik	3443546370	rbd: updated node labels fetching logic node labels are fetched only if controller is running in k8s and is nodeserver Signed-off-by: Jan Nemcik <jan.nemcik@solargis.com>	2023-12-11 10:59:50 +00:00
Riya Singhal	4b5cdd5316	util: addresed few todo this commit replaces string comparsion with error code at few places Signed-off-by: Riya Singhal <rsinghal@redhat.com>	2023-11-23 00:55:17 +00:00
Praveen M	4d466843b9	cephfs: add read affinity mount option This commit makes use of crush location labels from node labels to supply `crush_location` and `read_from_replica=localize` options during mount. Using these options, cephfs will be able to redirect reads to the closest OSD, improving performance. Signed-off-by: Praveen M <m.praveen@ibm.com>	2023-11-22 13:13:01 +00:00
Praveen M	c4e373c72f	deploy: support for read affinity options per cluster Implemented the capability to include read affinity options for individual clusters within the ceph-csi-config ConfigMap. This allows users to configure the crush location for each cluster separately. The read affinity options specified in the ConfigMap will supersede those provided via command line arguments. Signed-off-by: Praveen M <m.praveen@ibm.com>	2023-11-08 21:17:00 +00:00
Madhu Rajanna	304462c7cc	cleanup: fix spellcheck errors fixed spellcheck errors caught in CI. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-11-08 12:32:06 +00:00
Madhu Rajanna	9f753889ed	rbd: remove deprecated rbdImageRequiresEncryption remove support for deprecated rbdImageRequiresEncryption case. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-11-04 08:14:51 +00:00
Madhu Rajanna	3ea540bf0f	util: remove deprecated grpc metrics This commit removes the deprecated grpc related code from cephcsi. fixes: #4122 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-11-03 10:04:50 +00:00
Praveen M	0ed7a64191	rbd: update snap RbdImageName in createSnapshot This PR updates the snapshot RbdImageName in `createSnapshot` method. This resolves the incorrect statement logged during snapshot creation. Signed-off-by: Praveen M <m.praveen@ibm.com>	2023-10-03 11:45:03 +00:00
Praveen M	e504987984	rbd: update snap RbdImageName This commit updates the snapshot RbdImageName with the clone RbdImageName before snapshot creation. This will fix the incorrect log statement. Signed-off-by: Praveen M <m.praveen@ibm.com>	2023-09-28 11:51:13 +00:00
HF	5411a69839	rbd: fixed all potential crashing when decoding volume ID failed Signed-off-by: HF <crazytaxii666@gmail.com>	2023-09-06 13:46:22 +00:00
HF	80ad5b6b8f	rbd: fixed csi-rbdplugin crashes when decoding volume ID failed Signed-off-by: HF <crazytaxii666@gmail.com>	2023-09-05 12:08:53 +00:00
Madhu Rajanna	e013cfed15	rbd: fix resync issue During the Demote volume store the image creation timestamp. During Resync do below operation * Check image creation timestamp stored during Demote operation and current creation timestamp during Resync and check both are equal and its for force resync then issue resync * If the image on both sides is not in unknown state, check last_snapshot_timestamp on the local mirror description, if its present send volumeReady as false or else return error message. If both the images are in up+unknown the send volumeReady as true. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-08-30 09:07:46 +00:00
Rakshith R	98fdadfde7	rbd: do not execute rbd sparsify when volume is in use This commit makes sure sparsify() is not run when rbd image is in use. Running rbd sparsify with workload doing io and too frequently is not desirable. When a image is in use fstrim is run and sparsify will be run only when image is not mapped. Signed-off-by: Rakshith R <rar@redhat.com>	2023-07-11 13:48:36 +00:00
Niels de Vos	f60a358007	rbd: do not try to run `resizefs` on an encrypted BlockMode volume When a volume has AccessType=Block and is encrypted with LUKS, a resize of the filesystem on the (decrypted) block-device is attempted. This should not be done, as the application that requested the Block volume is the only authoritive reader/writer of the data. In particular VirtualMachines that use RBD volumes as a disk, usually have a partition table on the disk, instead of only a single filesystem. The `resizefs` command will not be able to resize the filesystem on the block-device, as it is a partition table. When `resizefs` fails during NodeStageVolume, the volume is unstaged and an error is returned. Resizing an encrypted block-device requires `cryptsetup resize` so that the LUKS header on the RBD-image is updated with the correct size. But there is no need to call `resizefs` in this case. Fixes: #3945 Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-07-03 14:54:39 +00:00
riya-singhal31	dbdb9086d8	rbd: migration of replication controller server this commit migrates the replication controller server from internal/rbd and adds it to csi-addons. Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-23 06:00:40 +00:00
riya-singhal31	cdaa9264eb	rbd: migration of replication service to csi-addon this commit removes grpc import from replication.go and replaced it with usual errors and passed gRPC responses in csi-addons Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-22 11:50:54 +00:00
riya-singhal31	b5e68c810e	rbd: add unit test for ParseEncryptionOpts Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-06 22:01:26 +00:00
riya-singhal31	347b4d2885	rbd: remove context where its not being used Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-06 22:01:26 +00:00
riya-singhal31	38f5e860e2	rbd: add check for EncryptionTypeNone this commit adds the validation for encryption value as false, and sets the type as none Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-06 22:01:26 +00:00
riya-singhal31	92d9785166	cleanup: ErrWaitTimeout is deprecated in k8s 1.27 replaced ErrWaitTimeout with Interrupted Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-06-06 12:21:43 +00:00
Niels de Vos	c968f6407d	build: address `dupword` warnings Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	b9b8392f71	build: address `errorlint` warning Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	9201da0502	build: address `gofmt` warnings Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	a6c14c051f	build: address `golint` warnings Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	e63ebb73c5	build: address `nlreturn` warnings Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	53c94efc02	build: address `gocritic` warnings Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Niels de Vos	81218a69f9	build: address `nolintlint` errors from new golangci-lint Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-06-05 04:49:46 +00:00
Liang Zheng	5a079122f4	rbd: can exit early if image-meta.json does not exist Signed-off-by: Liang Zheng <zhengliang0901@gmail.com>	2023-05-02 20:36:24 +00:00
riya-singhal31	304194a0c0	cleanup: migration of volrep to csi-addons This commit moves the volrep logic from internal/rbd to internal/csi-addons/rbd. Signed-off-by: riya-singhal31 <rsinghal@redhat.com>	2023-04-21 13:05:20 +00:00
Niels de Vos	37c8f07ed5	rbd: do not run mkfs on a BlockMode volume Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-03-08 16:26:39 +00:00
Niels de Vos	a4678200e5	rbd: allow setting `mkfsOptions` in the StorageClass Add `mkfsOptions` to the StorageClass and pass them to the `mkfs` command while creating the filesystem on the RBD device. Fixes: #374 Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-03-08 16:26:39 +00:00
Niels de Vos	13cdb08e61	rbd: cleanup passing `mkfs` arguments for NodeStageVolume Storing the default `mkfs` arguments in a map with key per filesystem type makes this a little more modular. It prepares th code for fetching the `mkfs` arguments from the VolumeContext. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2023-03-08 16:26:39 +00:00
Rakshith R	95682522ee	rbd: add capability to automatically enable read affinity This commit makes use of crush location labels from node labels to supply `crush_location` and `read_from_replica=localize` options during rbd map cmd. Using these options, ceph will be able to redirect reads to the closest OSD, improving performance. Signed-off-by: Rakshith R <rar@redhat.com>	2023-02-14 08:29:46 +00:00
Madhu Rajanna	e9e33fb851	cleanup: fix static checks fix SA1019 static check to replace io/utils with os package Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-02-02 14:53:59 +00:00
Madhu Rajanna	e54a97ba85	rbd: discover if StagingTargetPath in NodeExpandVolume The StagingTargetPath is an optional entry in NodeExpandVolumeRequest, We cannot expect it to be set always and at the same time cephcsi depended on the StaingTargetPath to retrieve some metadata information. This commit will check all the mount ref and identifies the stagingTargetPath by checking the image-meta.json file exists and this is a costly operation as we need to loop through all the mounts and check image-meta.json in each mount but this is happens only if the StaingTargetPath is not set in the NodeExpandVolumeRequest fixes #3623 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-01-31 08:20:36 +00:00
Madhu Rajanna	d5278bd6c5	rbd: set disableInUseChecks on rbd volume set disableInUseChecks on rbd volume struct as it will be used later to check whether the rbd image is allowed to mount on multiple nodes. fixes: #3604 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2023-01-11 16:24:07 +00:00
Humble Chirammal	71c4ae542c	rebase: remove protobuf dependency locking this commit remove the protobuf dependency locking in the module description. Also, ptypes.TimestampProto is deprecated and this commit make use of the timestamppb.New() for the construction. ParseTime() function has been removed and callers adjusted to the same. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-11-15 00:10:46 +00:00
Madhu Rajanna	d12400aa9c	rbd: unset metadata if setmetadata is false We need to unset the metadata on the clone and restore PVC if the parent PVC was created when setmetadata was set to true and it was set to false when restore and clone pvc was created. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-11-14 14:41:36 +00:00
Humble Chirammal	d70b594946	rbd: remove false error check in getDeviceSize this removed err condition will be always false as error is always nil. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-11-09 15:35:45 +00:00
Rakshith R	8650538b78	rbd: setup encryption if rbdVol exits during CreateVol This commit adds code to setup encryption on a rbdVol being repaired in a followup CreateVolume request. This is fixes a bug wherein encryption metadata may not have been set in previous request due to container restart. Fixes: #3402 Signed-off-by: Rakshith R <rar@redhat.com>	2022-11-07 12:49:18 +00:00
Madhu Rajanna	07e9dede2c	rbd: check volume details from original volumeID Checking volume details for the existing volumeID first. if details like OMAP, RBD Image, Pool doesnot exists try to use clusterIDMapping to look for the correct informations. fixes: #2929 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-11-04 06:32:05 +00:00
Madhu Rajanna	3e1f60244e	rbd: check for empty lastSyncTime Sometime the json unmarshal might get success and return empty time stamp. add a check to make sure the time is not zero always. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-11-03 08:10:19 +00:00
Madhu Rajanna	8f25edc888	rbd: return error if last sync time not present As per the csiaddon spec last sync time is required parameter in the GetVolumeReplicationInfo if we are failed to parse the description, return not found error message instead of nil which is empty response Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-11-03 08:10:19 +00:00
Madhu Rajanna	07aa9dea5c	rbd: update namespace name in rados object If a PV is reattached to a new PVC in a different namespace we need to update the namespace name in the rados object. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-28 15:50:01 +00:00
Madhu Rajanna	019628c8c2	rbd: update namespace name in metadata If a PV is reattached to a new PVC in a different namespace we need to update the namespace name in the rbd image metadata. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-28 15:50:01 +00:00
Madhu Rajanna	848e3ee557	rbd: return abnormal in NodeGetVolumeStats When we do stat on the targetpath, if there is any error we can check is it due to corruption. If yes, cephcsi can return abnormal in the NodeGetVolumeStats so that consumer (CO/admin) and detect and take further action. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-26 09:40:22 +00:00
Madhu Rajanna	f12fa3ee56	rbd: return GRPC error from GRPC method GRPC methods should only return GRPC errors if any error occurs. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-19 08:00:42 +00:00
Marcel Lauhoff	dc7ba684e3	rbd: Use EncryptionTypeNone Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	1f1504479c	rbd: Add context to fscrypt errors Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	3e3af4da18	rbd: support file encrypted snapshots Support fscrypt on RBD snapshots Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	82d92aab4a	rbd: Add volume journal encryption support Add fscrypt support to the journal to support operations like snapshotting. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	a7ea12eb8e	rbd: Handle encryption type default at a more meaningful place Different places have different meaningful fallback. When parsing from user we should default to block, when parsing stored config we should default to invalid and handle that as an error. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	1fa842277a	rbd: fscrypt file encryption support Integrate basic fscrypt functionality into RBD initialization. To activate file encryption instead of block introduce the new 'encryptionType' storage class key. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	ce9fbb3474	rbd: Rename encryption to blockEncryption prep for fscrypt In preparation of fscrypt support for RBD filesystems, rename block encryption related function to include the word 'block'. Add struct fields and IsFileEncrypted. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Marcel Lauhoff	fe4821435e	util: Make encryption passphrase size a parameter fscrypt support requires keys longer than 20 bytes. As a preparation, make the new passphrase length configurable, but default to 20 bytes. Signed-off-by: Marcel Lauhoff <marcel.lauhoff@suse.com>	2022-10-17 17:33:52 +00:00
Madhu Rajanna	69eb6e40dc	rbd: return GRPC error message The error message return from the GRPC should be of GRPC error messages only not the normal go errors. This commits returns GRPC error if setAllMetadata fails. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-17 15:17:29 +00:00
Madhu Rajanna	01d4a614c3	rbd: delete volume if setallmetadata fails If any operations fails after the volume creation we will cleanup the omap objects, but it is missing if setAllMetadata fails. This commits adds the code to cleanup the rbd image if metadata operation fails. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-17 15:17:29 +00:00
Niels de Vos	b7703faf37	util: make inode metrics optional in FilesystemNodeGetVolumeStats() CephFS does not have a concept of "free inodes", inodes get allocated on-demand in the filesystem. This confuses alerting managers that expect a (high) number of free inodes, and warnings get produced if the number of free inodes is not high enough. This causes alerts to always get reported for CephFS. To prevent the false-positive alerts from happening, the NodeGetVolumeStats procedure for CephFS (and CephNFS) will not contain inodes in the reply anymore. See-also: https://bugzilla.redhat.com/2128263 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-10-13 19:02:47 +00:00
Madhu Rajanna	71e5b3f922	rbd: remove dummy image workaround To address the problem that snapshot schedules are triggered for volumes that are promoted, a dummy image was disabled/enabled for replication. This was done as a workaround, because the promote operation was not triggering the schedules for the image being promoted. The bugs related to the same have been fixed in RBD mirroring functionality and hence the workaround #2656 can be removed from the code base. ceph tracker https://tracker.ceph.com/issues/53914 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-10-10 08:22:10 +00:00
Yati Padia	36b061d426	rbd: get description from remote status This commit gets the description from remote status instead of local status. Local status doesn't have ',' due to which we get array index out of range panic. Fixes: #3388 Signed-off-by: Yati Padia <ypadia@redhat.com> Co-authored-by: shyam Ranganathan <srangana@redhat.com>	2022-09-14 12:06:01 +00:00
yati1998	b19705f260	rbd: implements getVolumeReplicationInfo This commit implements getVolumeReplicationInfo to get the last sync time and update it in volume replication CR. Signed-off-by: yati1998 <ypadia@redhat.com>	2022-09-13 14:17:10 +00:00
Madhu Rajanna	71dbc7dbb4	rbd: map only primary image If the image is mirroring enabled and primary consider it for mapping, if the image is mirroring enabled but not primary yet. return error message until the image is marked as primary. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-09-06 10:40:12 +00:00
Rakshith R	19e4146fab	rbd: add replication capability & service to csiaddons server csi-addons server will advertise replication capability and replication service will run with csi-addons server too. Signed-off-by: Rakshith R <rar@redhat.com>	2022-08-18 08:19:20 +00:00
Shyamsundar Ranganathan	c2280011d1	rbd: Report remote peer readiness if Up and status.Unknown Current code uses an !A && !B condition incorrectly to test A:Up and B:status for a remote peer image. This should be !A \|\| !B as we require both conditions to be in the specified state (Up: true, and status Unknown). This is corrected by this commit, and further fixes: - check and return ready only when a remote site is found in the status output - check if all peer sites are ready, if multiple are found and return ready appropriately Signed-off-by: Shyamsundar Ranganathan <srangana@redhat.com>	2022-08-09 05:32:15 +00:00
Madhu Rajanna	8d7b6ee59f	rbd: consider mirror deamon state for ResyncVolume During ResyncVolume we check if the image is in an error state, and we resync. After resync, the image will move to either the `Error` or the `Resyncing` state. And if the image is in the above two conditions, we will return a successful response and Ready=false so that the consumer can wait until the volume is ready to use. If the image is in any other state we return an error message to indicate the syncing is not going on. The whole resync and image state change depends on the rbd mirror daemon. If the mirror daemon is not running, the image can be in Resyncing or Unknown state. The Ramen marks the volume replication as secondary, and once the resync starts, it will delete the volume replication CR as a cleanup process. As we dont have a check for the rbd mirror daemon, we are returning a resync success response and Ready=false. Due to this false response Ramen is assuming the resync started and deleted the volume replication CR, and because of this, the cluster goes into a bad state and needs manual intervention. fixes #3289 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-08-08 13:26:15 +00:00
Niels de Vos	83df1eae53	rebase: k8s.io/mount-utils/IsNotMountPoint() is deprecated IsNotMountPoint() is deprecated and Mounter.IsMountPoint() is recommended to be used instead. Reported-by: golangci/staticcheck Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-08-04 09:53:07 +00:00
Niels de Vos	3a200b6976	rbd: use IsLikelyNotMountPoint() to prevent systemd log messages Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-08-04 09:53:07 +00:00
Humble Chirammal	bc9ad3d9f1	rbd: add dummy attacher implementation previously, it was a requirement to have attacher sidecar for CSI drivers and there had an implementation of dummy mode of operation. However skipAttach implementation has been stabilized and the dummy mode of operation is going to be removed from the external-attacher. Considering this driver work on volumeattachment objects for NBD driver use cases, we have to implement dummy controllerpublish and unpublish and thus keep supporting our operations even in absence of dummy mode of operation in the sidecar. This commit make a NOOP controller publish and unpublish for RBD driver. CephFS driver does not require attacher and it has already been made free from the attachment operations. Ref# https://github.com/ceph/ceph-csi/pull/3149 Ref# https://github.com/kubernetes-csi/external-attacher/issues/226 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-08-03 00:25:49 +00:00
Madhu Rajanna	8c5563a9bc	rbd: remove checkHealthyPrimary check After Failover of workloads to the secondary cluster when the primary cluster is down, RBD Image is not marked healthy, and VR resources are not promoted to the Primary, In VolumeReplication, the `CURRENT STATE` remains Unknown and doesn't change to Primary. This happens because the primary cluster went down, and we have force promoted the image on the secondary cluster. and the image stays in up+stopping_replay or could be any other states. Currently assumption was that the image will always be `up+stopped`. But the image will be in `up+stopped` only for planned failover and it could be in any other state if its a forced failover. For this reason, removing checkHealthyPrimary from the PromoteVolume RPC call. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-07-27 09:04:27 +00:00
Niels de Vos	011d4fc81c	cleanup: create k8s.io/mount-utils Mounter only once Recently the k8s.io/mount-utils package added more runtime dectection. When creating a new Mounter, the detect is run every time. This is unfortunate, as it logs a message like the following: ``` mount_linux.go:283] Detected umount with safe 'not mounted' behavior ``` This message might be useful, so it probably good to keep it. In Ceph-CSI there are various locations where Mounter instances are created. Moving that to the DefaultNodeServer type reduces it to a single place. Some utility functions need to accept the additional parameter too, so that has been modified as well. See-also: kubernetes/kubernetes#109676 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-07-21 07:14:43 +00:00
Benoît Knecht	507844c9b1	rbd: Use rados namespace when getting clone depth When the Ceph user is restricted to a specific namespace in the pool, it is crucial that evey interaction with the cluster is done within that namespace. This wasn't the case in `getCloneDepth()`. This issue was causing snapshot creation to fail with > Failed to check and update snapshot content: failed to take snapshot of the > volume X: "rpc error: code = Internal desc = rbd: ret=-1, Operation not > permitted" Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2022-07-07 22:20:29 +00:00
Niels de Vos	14ba1498bf	util: reduce systemd related errors while mounting There are regular reports that identify a non-error as the cause of failures. The Kubernetes mount-utils package has detection for systemd based environments, and if systemd is unavailable, the following error is logged: Cannot run systemd-run, assuming non-systemd OS systemd-run output: System has not been booted with systemd as init system (PID 1). Can't operate. Failed to create bus connection: Host is down, failed with: exit status 1 Because of the `failed` and `exit status 1` error message, users might assume that the mounting failed. This does not need to be the case. The container-images that the Ceph-CSI projects provides, do not use systemd, so the error will get logged with each mount attempt. By using the newer MountSensitiveWithoutSystemd() function from the mount-utils package where we can, the number of confusing logs get reduced. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2022-07-04 10:02:54 +00:00
Yati Padia	5c40f1ef33	rbd: remove the clone in case of failure This commit removes the clone incase unsetAllMetadata or copyEncryptionConfig or expand fails for createVolumeFromSnapshot and CreateSnapshot. It also removes the clone in case of any failure in createCloneFromImage. issue: #3103 Signed-off-by: Yati Padia <ypadia@redhat.com>	2022-06-30 05:50:16 +00:00
Prasanna Kumar Kalever	9fa3c8382b	cleanup: reduce struct padding internal/rbd/rbd_util.go:89:15: struct of size 312 bytes could be of size 304 bytes: `` struct{ RbdImageName string, ImageID string, VolID string, Monitors string, JournalPool string, Pool string, RadosNamespace string, ClusterID string, RequestName string, NamePrefix string, ParentName string, ParentPool string, ClusterName string, Owner string, VolSize int64, StripeCount uint64, StripeUnit uint64, ObjectSize uint64, ImageFeatureSet github.com/ceph/go-ceph/rbd.FeatureSet, encryption github.com/ceph/ceph-csi/internal/util.VolumeEncryption, CreatedAt google.golang.org/protobuf/types/known/timestamppb.Timestamp, conn github.com/ceph/ceph-csi/internal/util.ClusterConnection, ioctx github.com/ceph/go-ceph/rados.IOContext, Primary bool, EnableMetadata bool, } `` (maligned) type rbdImage struct { ^}` make: *** [Makefile:118: go-lint] Error 1 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-28 19:12:53 +00:00
Prasanna Kumar Kalever	caf4090657	rbd: provide option to disable setting metadata on rbd images As we added support to set the metadata on the rbd images created for the PVC and volume snapshot, by default metadata is set on all the images. As we have seen we are hitting issues#2327 a lot of times with this, we start to leave a lot of stale images. Currently, we rely on `--extra-create-metadata=true` to decide to set the metadata or not, we cannot set this option to false to disable setting metadata because we use this for encryption too. This changes is to provide an option to disable setting the image metadata when starting cephcsi. Fixes: #3009 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-28 19:12:53 +00:00
Madhu Rajanna	8a47904e8f	rbd: add unit test for checkHealthyPrimary Removed the code in checkHealthyPrimary which makes the ceph call, passing it as input now. Added unit test for checkHealthyPrimary function Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-28 13:17:11 +00:00
Madhu Rajanna	53e76fab69	rbd: fix checkHealthyPrimary to consider up+stopped state we need to check for image should be in up+stopped state not anyone of the state for that the we need to use OR check not the AND check. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-28 13:17:11 +00:00
Madhu Rajanna	704cb5c941	revert: rbd: consider remote image health for primary When the image is force promoted to primary on the cluster the remote image might not be in replaying state because due to the split brain state. This PR reverts back the commit `c3c87f2ef3`. Which we added to check the remote image status. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-28 13:17:11 +00:00
Prasanna Kumar Kalever	1da446d2f2	rbd: healer detect Kubernetes version for right StagingTargetPath Kubernetes 1.24 and newer use a different path for staging the volume. That means the CSI-driver is requested to mount the volume at an other location, compared to previous versions of Kubernetes. CSI-drivers implementing the volumeHealer, must receive the correct path, otherwise the after a nodeplugin restart the NBD mounts will bailout attempting to NodeStageVolume() call and return an error. See-also: kubernetes/kubernetes#107065 Fixes: #3176 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2022-06-24 12:23:29 +00:00
Madhu Rajanna	3acaa018db	rbd: issue resync only if the force flag is set During failover we do demote the volume on the primary as the image is still not promoted yet on the remote cluster, there are spurious split-brain errors reported by RBD, the Cephcsi resync will attempt to resync from the "known" secondary and that will cause data loss Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-23 13:28:18 +00:00
Robert Vasek	0807fd2e6c	journal: added csi.volume.backingsnapshotid image attribute Signed-off-by: Robert Vasek <robert.vasek@cern.ch>	2022-06-16 09:44:27 +00:00
Madhu Rajanna	4b57cc3ec5	rbd: add support for rbd striping RBD supports creating rbd images with object size, stripe unit and stripe count to support striping. This PR adds the support for the same. More details about striping at https://docs.ceph.com/en/quincy/man/8/rbd/#striping fixes: #3124 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2022-06-09 18:59:00 +00:00

1 2 3 4 5 ...

609 Commits