ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-06-03 04:16:42 +00:00

Author	SHA1	Message	Date
Niraj Yadav	2801f153d2	cephfs: remove extraneous creation of credentials `ControllerExpandVolume` creates the credentials from secrets but never actually uses it for anything. The secrets map is passed on to `NewVolumeOptionsFromVolID` which does the same check again. This patch removes the extraneous step. Signed-off-by: Niraj Yadav <niryadav@redhat.com>	2024-11-27 14:37:51 +00:00
Nikhil-Ladha	98cf0780e1	cephfs: log clone progress log cephfs clone progress report during cephfs clone operation Signed-off-by: Nikhil-Ladha <nikhilladha1999@gmail.com>	2024-11-22 08:04:50 +00:00
Madhu Rajanna	00d252e4ac	rbd: use os.Remove to remove directory using os.RemoveAll will remove everything in the director after the Umount we should be using os.Remove only to remove the empty directory Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-21 10:18:56 +00:00
Madhu Rajanna	cd09266870	cephfs: use os.Remove to remove directory using os.RemoveAll will remove everything in the director after the Umount we should be using os.Remove only to remove the empty directory Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-21 10:18:56 +00:00
Madhu Rajanna	7cfeae579f	cephfs: take lock on targetpath on node operation We should not be dependent on the CO to ensure that it will serialize the request instead of that we need to have own internal locks to ensure that we dont do concurrent operations for same request. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-21 10:18:56 +00:00
Madhu Rajanna	b6bd8ca71a	rbd: take lock on targetpath during node operation We should not be dependent on the CO to ensure that it will serialize the request instead of that we need to have own internal locks to ensure that we dont do concurrent operations for same request. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-21 10:18:56 +00:00
Rakshith R	d457840d21	rbd: set depthToAvoidFlatten to 3 during PVC-PVC clone During PVC-PVC clone creation, parent of the datasource image is flattened after checking for clone depth. We need to account for data source image as well since we're calculating depth from the parent image. depthToAvoidFlatten = 3(datasource image + temp + final clone) Signed-off-by: Rakshith R <rar@redhat.com>	2024-11-19 11:34:34 +00:00
Rakshith R	eea64fe1f9	rbd: remove checkFlatten() function CephCSI should not flatten image that can be mounted for use by the user. `checkFlatten()` was called in a recovery code flow of PVC restored from snapshot and was missed while refractoring in https://github.com/ceph/ceph-csi/pull/2900 refer: #2900 Signed-off-by: Rakshith R <rar@redhat.com>	2024-11-19 11:34:34 +00:00
Niels de Vos	d98516e9d8	rbd: add locking for VolumeGroupSnapshot operations Add VolumeGroupLocks in the CSI Controller Server so that operations are protected against concurrent requests for the same VolumeGroupSnapshot. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-12 09:28:30 +00:00
Niels de Vos	f3d40f9e5a	rbd: cleanup inconsistent state in `reserveSnap()` after a failure `reserveSnap()` can potentially fail halfway through, in that case it needs to undo the snapshot reservation and restore modified attributes of the snapshot. Fixes: #4945 Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-11 13:39:05 +00:00
Niels de Vos	cea8bf8110	rbd: set SnapshotGroupID on each Snapshot of a VolumeGroupSnapshot Without the SnapshotGroupID in the Snapshot object, Kubernetes CSI does not know that the Snapshot belongs to a group. In that case, it allows the deletion of the Snapshot, which should be denied. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	ec1e7a4ee0	rbd: expose the GroupControllerService When the GroupSnapGetInfo go-ceph function is supported by librbd, the Group Controller Servive and VolumeGroupSnapshot capabilities can be exposed to the Container Orchestrator. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	e34dceff27	rbd: implement CSI Group Controller Server Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	e011e74b9d	rbd: fix snapshot deletion by resolving image names correctly When creating a Snapshot with the new NewSnapshotByID() function, the name of the RBD-image that is created is the same as the name of the Snapshot. The `RbdImageName` points to the name of parent image, which causes deleting the Snapshot to delete the parent image instead. Correcting the `RbdImageName` and setting it to the `RbdSnapName` makes sure that upon deletion, the Snapshot RBD-image is removed, and not the parent image. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	fdccba1f33	rbd: add Manager.GetVolumeGroupSnapshotByName The Group Controller Server may need to fetch a VolumeGroupSnapshot that was statically provisioned. In that case, only the name of the VolumeGroupSnapshot is known and should be resolved to an object. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	ad381c4ff0	rbd: implement Manager.GetVolumeGroupSnapshotByID The GetVolumeGroupSnapshotByID function makes it possible to get a VolumeGroupSnapshot object from the Manager by passing a request-id. This makes it simple for the Group Controller Server to check if a VolumeGroupSnapshot already exists, so it is not needed to try and re-create an existing one. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	7563f4285d	rbd: add manager.CreateVolumeGroupSnapshot() Implement the CreateVolumeGroupSnapshot for the rbd.Manager. A Group Controller Server can use the rbd.Manager to create VolumeGroupSnapshots in an easy an idempotent way. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	9bea3feff1	rbd: add manager GetSnapshotByID and SnapshotResolver interface A (CSI) VolumeGroupSnapshot object contains references to Snapshot IDs (or CSI Snapshot handles). In order to work with a VolumeGroupSnapshot struct, the Snapshot IDs need to be resolved into rbdSnapshot structs. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	455a90e9f4	rbd: add VolumeGroupSnapshot type The VolumeGroupSnapshot type will be used by the rbd.Manager to create, inspect and delete VolumeGroupSnapshos. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	efb7bccaea	rbd: add VolumeGroup.CreateSnapshots() implementation When the rbd.Manager creates a VolumeGroupSnapshot, each RBD-snapshot that is created as part of the RBD-group needs to be cloned into its own RBD-image that will be used as a CSI Snapshot. The VolumeGroup.CreateSnapshots() creates the RBD-group snapshot and returns a list of the Snapshot structs. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	20fadf2016	rbd: add `rbdVolume.NewSnapshotByID` to clone images by RBD snapshot-id The NewSnapshotByID() function makes it possible to clone a new Snapshot from an existing RBD-image and the ID of an RBD-snapshot on that image. This will be used by the VolumeGroupSnapshot feature, where the ID of an RBD-snapshot is obtained for the RBD-snapshot on the RBD-images. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	9808408340	rbd: pass CSI-drivername to volume group instead of journal instance Each object is responsible for maintaining a connection to the journal. By sharing a single journal, cleanup of objects becomes more complex as the journal is used in deferred functions and only the last should destroy the journal connection resources. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	29bf5797b0	rbd: add `.requestName` to the `commonVolumeGroup` struct Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	4b13e9132b	rbd: have `GetVolumeGroup()` return an empty volume group if it was not found Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	5d5171c7d7	journal: store `csi.groupid` for snapshots Commit 95733b3a9 introduced the `StoreGroupID()` function, but that unfortunately set an empty key in the journal. Passing the `csiGroupIDKey` key (with value `csi.groupid`) caused setting `csi.csi.groupid` as a key. Reading the value back with the right `csi.groupid` key always returned an empty value. Fixes: 95733b3a9 "journal: add option to store the groupID" Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Niels de Vos	6d88e0a4c7	rbd: close the RBD-image after adding it to a VolumeGroup When the image is not closed, it keeps a watch open. This prevents the CSI Controller to delete the Volume, as there is still a user of it. Fixes: f9ab14e826 "rbd: check if an image is part of a group before adding it" Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-11-06 11:37:44 +00:00
Madhu Rajanna	b4592a55eb	rbd: parse IP address The address we get from ceph contains the ip in the format of 10.244.0.1:0/2686266785 we need to extract the client IP from this address, we already have a helper to extract it, This makes the helper more generic can be reused by multiple packages in the fence controller. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-06 09:48:45 +00:00
Madhu Rajanna	facf805941	rbd: implement GetFenceClients implemented GetFenceClients which connects to the ceph cluster and returns the ceph clusterID and the clientaddress that is used for rados connection. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-06 09:48:45 +00:00
Madhu Rajanna	ba8c5a359c	util: add GetAddrs method added GetAddrs to get the client Adress of the rados connection which is helpful for NetworkFencing Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-06 09:48:45 +00:00
Madhu Rajanna	fdc74973d8	rbd: register GET_CLIENTS_TO_FENCE caps register Capability_NetworkFence_ GET_CLIENTS_TO_FENCE capability and start a NetworkFence controllers as part of rbd nodeplugin. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-11-06 09:48:45 +00:00
Niraj Yadav	1c02e69ba4	rbd: Add timeout for cryptsetup commands This PR modifies the execCryptSetupCommand so that the process is killed in an event of lock timeout. Useful in cases where the volume lock is released but the command is still running. Signed-off-by: Niraj Yadav <niryadav@redhat.com>	2024-11-05 11:39:59 +00:00
Praveen M	86759d4653	cephfs: support omap store in radosnamespace This commit adds the support for storing the CephFS omap data in a namespace specified in the ceph-csi-config ConfigMap under cephFS.radosNamespace field. If the radosNamespace is not set, the default radosNamespace will be used i.e, csi. Signed-off-by: Praveen M <m.praveen@ibm.com>	2024-10-21 14:11:27 +00:00
Praveen M	c7f41cf84b	util: add GetCephFSRadosNamespace method This commit adds `GetCephFSRadosNamespace` util method that returns the `RadosNamespace` specified in ceph-csi-config ConfigMap under cephFS.radosNamespace. If not specified, the method returns the default RadosNamespace i.e, csi. Signed-off-by: Praveen M <m.praveen@ibm.com>	2024-10-21 14:11:27 +00:00
Niels de Vos	a51a6ae43a	rbd: add types.Snapshot interface The rbdSnapshot/rbdImage object implements all functions for a useful Snapshot interface. The rbd.Manager will be able to use this for providing VolumeGroupSnapshot support. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-17 16:30:33 +00:00
Niels de Vos	f885c77f4e	rbd: use `GetCreationTime()` to build the CSI-Snapshot object Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-17 16:30:33 +00:00
Niels de Vos	6df173dbf3	journal: only destroy the connection if it is set Prevent re-use of a destroyed connection by setting it to `nil`. This way it is also safe to call `Destroy()` multiple times without causing a panic. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-14 14:54:29 +00:00
Niels de Vos	e154eae732	cleanup: use `err` and `target` in recommended order to `errors.Is()` The documentation has `error.Is(err, target)`, so use this as the order of the parameters. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-14 07:29:12 +00:00
Niels de Vos	3802dd2c2c	rbd: add feature check to see if GroupSnapGetInfo is available The go-ceph rbd package provides the GroupSnapGetInfo function, but it may return ErrUnsupported when called. Returning this error after advertising the support for VolumeGroupSnapshot seems ugly. In order to advertise support for VolumeGroupSnapshot, SupportsGroupSnapGetInfo() can be used, which detects the required C function of librbd. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-10 15:45:47 +00:00
Niels de Vos	d33e6b14fe	rbd: validate IOContext before getting the list of trashed images `ensureImageCleanup()` can cause a panic when an image was deleted, but the journal still contained a reference. By opening the IOContext before using, an error may be returned instead of a panic when using a `nil` or freed IOContext. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-04 11:04:22 +00:00
Niels de Vos	10076ca11f	rbd: use the new go-ceph rbd.ErrExist for checking rbd.GroupCreate() The go-ceph rbd.GroupCreate() now returns ErrExist in case the group that is created, already exists. The previous check only ever matched the string comparison, which is prone to errors in case the contents is modified by go-ceph. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-10-04 09:00:23 +00:00
Madhu Rajanna	88b964fe18	rbd: consider ErrPermissionDenied for vol Incase of RDR with restricted access the ceph user will not have access to all the objects or all the pools where mapping exists This commits add a check to continue to get the volume if there is a permission error Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2024-10-03 08:40:07 +00:00
Niels de Vos	2d82cebfeb	rbd: move repairImageID() from rbdVolume struct to rbdImage The `repairImageID()` function is useful for the `rbdSnapshot` objects as well. Move it to the `rbdImage` struct that is the base for both `rbdVolume` and `rbdSnapshot`. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-09-26 18:02:22 +00:00
Niels de Vos	f2bc1c674b	rbd: replace Manager.DeleteVolumeGroup() by VolumeGroup.Delete() There is no need for the `Manager.DeleteVolumeGroup()` function as `VolumeGroup.Delete()` should cover everything too. By moving the `.Delete()` functionality of removing the group from the journal to the shared `commonVolumeGroup` type, a volume group snaphot can use it as well. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-09-26 13:59:21 +00:00
Nikhil-Ladha	01a0ec2d8c	util: use protobuf encoding for core k8s apis For core K8s API objects like Pods, Nodes, etc., we can use protobuf encoding which reduces CPU consumption related to (de)serialization, reduces overall latency of the API call, reduces memory footprint, reduces the amount of work performed by the GC and results in quicker propagation of objects to event handlers of shared informers. Signed-off-by: Nikhil-Ladha <nikhilladha1999@gmail.com>	2024-09-26 11:52:21 +00:00
Niels de Vos	8c252d58ea	rbd: prevent re-use of destroyed resources When an `.Destroy()` is called on an rbdImage (or rbdVolume or rbdSnapshot), the IOContext, Connection and other attributes are invalid. When using a destroyed resource that points to an object that was allocated through librbd, the process most likely ends with a panic. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-09-26 09:37:21 +00:00
yati1998	dbd8462bcc	cephfs: correct error code for volumegroupsnapshot this commit corrects the error code used by groupvolumelocks to acquire snapshots Signed-off-by: yati1998 <ypadia@redhat.com>	2024-09-24 15:13:12 +00:00
yati padia	29aecd345f	cephfs: return correct error msg return SnapshotOperationAlreadyExistsFmt instead of VolumeOperationAlreadyExistsFmt incase of delete snapshot operation. Signed-off-by: yati1998 <ypadia@redhat.com>	2024-09-23 14:36:19 +00:00
Robert Vasek	7a727c2a43	util: added logs for slow gRPC calls This commit adds a gRPC middleware that logs calls that keep running after their deadline. Adds --logslowopinterval cmdline argument to pass the log rate. Signed-off-by: Robert Vasek <robert.vasek@clyso.com>	2024-09-20 08:55:17 +00:00
Niels de Vos	05d501a728	rbd: prevent panic when using rbdImage that is not connected When an `rbdVolume` or `rbdSnapshot` is not connected with credentials to the Ceph cluster, operations may try to get the IOContext which then causes a panic. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-09-18 07:09:12 +00:00
Niels de Vos	42fc0b6bce	rbd: rename `setImageOptions()` to `constructImageOptions()` A function called `setImageOptions()` is expected to set the passed options on the volume. However, the passed options parameter is only filled with the options that should get set on the RBD-image at the time of creation. The naming of the function, and it's parameter is confusing. Rename the function to `constructImageOptions()` and return the ImageOptions to make it easier to understand. Signed-off-by: Niels de Vos <ndevos@ibm.com>	2024-09-12 10:31:49 +00:00

1 2 3 4 5 ...

1290 Commits