ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2024-12-23 13:30:23 +00:00

Author	SHA1	Message	Date
Humble Chirammal	8ea495ab81	rbd: skip volumeattachment processing if pv marked for deletion if the volumeattachment has been fetched but marked for deletion the nbd healer dont want to process further on this pv. This patch adds a check for pv is marked for deletion and if so, make the healer skip processing the same Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-08-26 15:04:19 +00:00
Niels de Vos	6d00b39886	cleanup: move log functions to new internal/util/log package Moving the log functions into its own internal/util/log package makes it possible to split out the humongous internal/util packages in further smaller pieces. This reduces the inter-dependencies between utility functions and components, preventing circular dependencies which are not allowed in Go. Updates: #852 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-08-26 09:34:05 +00:00
Prasanna Kumar Kalever	4f40213d8e	rbd: fix rbd-nbd io-timeout to never abort With the tests at CI, it kind of looks like that the IO is timing out after 30 seconds (default with rbd-nbd). Since we have tweaked reattach-timeout to 300 seconds at ceph-csi, we need to explicitly set io-timeout on the device too, as it doesn't make any sense to keep io-timeout < reattach-timeout Hence we set io-timeout for rbd nbd to 0. Specifying io-timeout 0 tells the nbd driver to not abort the request and instead see if it can be restarted on another socket. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com> Suggested-by: Ilya Dryomov <idryomov@redhat.com>	2021-08-24 17:09:09 +00:00
Prasanna Kumar Kalever	3bf17ade7a	doc: update code comments about available timeout options Adding some code comments to make them readable and easy to understand. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-08-24 17:09:09 +00:00
Prasanna Kumar Kalever	ea3def0db2	rbd: remove per volume rbd-nbd logfiles on detach - Update the meta stash with logDir details - Use the same to remove logfile on unstage/unmap to be space efficient Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-08-24 07:15:30 +00:00
Prasanna Kumar Kalever	d67e88ccd0	cleanup: embed args into struct and pass it to detachRBDImageOrDeviceSpec Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-08-24 07:15:30 +00:00
Prasanna Kumar Kalever	474100c1f1	rbd: add a unit test for getCephClientLogFileName() Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-08-24 07:15:30 +00:00
Prasanna Kumar Kalever	682b3a980b	rbd: rbd-nbd logging the ceph-CSI way - One logfile per device/volume - Add ability to customize the logdir, default: /var/log/ceph Note: if user customizes the hostpath to something else other than default /var/log/ceph, then it is his responsibility to update the `cephLogDir` in storageclass to reflect the same with daemon: ``` cephLogDir: "/var/log/mynewpath" ``` Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-08-24 07:15:30 +00:00
Humble Chirammal	5089a4ce5d	doc: correct some source code comments in rbd driver code Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-08-17 06:57:09 +00:00
Madhu Rajanna	5562e46d0f	rbd: Cleanup OMAP data for secondary image If the image is in a secondary state and its up+replaying means its an healthy secondary and the image is primary somewhere in the remote cluster and the local image is getting replayed. Delete the OMAP data generated as we cannot delete the secondary image. When the image on the primary cluster gets deleted/mirroring disabled, the image on all the remote (secondary) clusters will get auto-deleted. This helps in garbage collecting the OMAP, PVC and PV objects after failback operation. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-16 17:38:25 +00:00
Madhu Rajanna	fc0d6f6b8b	rbd: return succuss if image is healthy secondary If the image is in secondary state and its up+replaying means its an healthy secondary and the image is primary somewhere in the remote cluster and the local image is getting replayed. Return success for the Disabling mirroring as we cannot disable the mirroring on the secondary state, when the image on the remote site gets disabled the image on all the remote (secondary) will get auto deleted. This helps in garbage collecting the volume replication kuberentes artifacts Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-16 17:38:25 +00:00
Madhu Rajanna	35324b2e17	rbd: add helper function to get local state added helper function to check the local image state is up+replaying. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-16 17:38:25 +00:00
Humble Chirammal	87beaac25b	rbd: add ReadWriteOncePod in accessModeStrToInt() conversion function Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-08-12 09:55:50 +00:00
Rakshith R	f05ac2b25d	rbd: extract kmsID from volumeAttributes in RegenerateJournal() This commit adds functionality of extracting encryption kmsID, owner from volumeAttributes in RegenerateJournal() and adds utility functions ParseEncryptionOpts and FetchEncryptionKMSID. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-10 09:17:59 +00:00
Rakshith R	b960e3633a	rbd: extract volumeNamePrefix in RegenerateJournal() Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-10 09:17:59 +00:00
Rakshith R	b9b4b1e34e	rbd: refractor RegenerateJournal() to take in volumeAttributes This commit refractors RegenerateJournal() to take in volumeAttributes map[string]string as argument so it can extract required attributes internally. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-10 09:17:59 +00:00
Rakshith R	39d6752fc1	rbd: use `CSIInstanceID` var instead of "default" in RegenerateJournal() Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-10 09:17:59 +00:00
Madhu Rajanna	3c85219962	rbd: consider empty mirroring mode consider the empty mirroring mode when validating the snapshot interval and the scheduling time. Even if the mirroring Mode is not set validate the snapshot scheduling details as cephcsi sets the mirroring mode to default snapshot. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-09 11:05:05 +00:00
Rakshith R	825211730c	rbd: fix snapshot id idempotency issue This commit fixes snapshot id idempotency issue by always returning an error when flattening is in progress and not using `readyToUse:false` response. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-09 07:28:43 +00:00
Rakshith R	859d696279	cleanup: refractor checkCloneImage to reducing nesting if This commit refractors checkCloneImage function to address nestif linter issue. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-09 07:28:43 +00:00
Madhu Rajanna	a5a8952716	rbd: fix clone problem This commit fixes a bug in checkCloneImage() which was caused by checking cloned image before checking on temp-clone image snap in a subsequent request which lead to stale images. This was solved by checking temp-clone image snap and flattening temp-clone if needed. This commit also fixes comparison bug in flattenCloneImage(). Signed-off-by: Rakshith R <rar@redhat.com> Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-09 07:28:43 +00:00
Rakshith R	08728b631b	rbd: fix vol.VolID in cloneFromSnapshot() Volume generated from snap using genrateVolFromSnap already copies volume ID correctly, therefore removing `vol.VolID = rbdVol.VolID` which wrongly copies parent Volume ID instead leading to error from copyEncryption() on parent and clone volume ID being equal. Signed-off-by: Rakshith R <rar@redhat.com>	2021-08-09 07:28:43 +00:00
Madhu Rajanna	2782878ea2	rbd: log LastUpdate in UTC format This Commit converts the LastUpdate from int to the UTC format and logs it for better debugging. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-06 10:18:51 +00:00
Madhu Rajanna	92ad2ceec9	rbd: read clusterID and PoolID from mapping Whenever Ceph-CSI receives a CSI/Replication request it will first decode the volumeHandle and try to get the required OMAP details if it is not able to retrieve, receives a `Not Found` error message and Ceph-CSI will check for the clusterID mapping. If the old volumeID `0001-00013-site1-storage-0000000000000001 -b0285c97-a0ce-11eb-8c66-0242ac110002` contains the `site1-storage` as the clusterID, now Ceph-CSI will look for the corresponding clusterID `site2-storage` from the above configmap. If the clusterID mapping is found now Ceph-CSI will look for the poolID mapping ie mapping between `1` and `2`. Example:- pool with name exists on both the clusters with different ID's Replicapool with ID `1` on site1 and Replicapool with ID `2` on site2. After getting the required mapping Ceph-CSI has the required information to get more details from the rados OMAP. If we have multiple clusterID mapping it will loop through all the mapping and checks the corresponding pool to get the OMAP data. If the clusterID mapping does not exist Ceph-CSI will return an `Not Found` error message to the caller. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-08-05 16:07:51 +00:00
Madhu Rajanna	8f185bf7b2	rbd: use rados namespace for manager command Currently we have a bug that we are not using rados namespace when adding ceph manager command to remove the image from the trash. This commit adds the missing rados namespace when adding ceph manager task. without fix the image will be moved to trash and no task will be added to remove from the trash. it will become ceph responsibility to remove the image from trash when it will cleanup the trash. workaroud: manually purge the trash Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-28 03:48:33 +00:00
Niels de Vos	ec6703ed58	rbd: rename encryption metadata keys to enable mirroring RBD image metadata keys that start with '.rbd' are expected to be internal to RBD itself and are not mirrored to remote sites. Renaming the keys (dropping the '.' prefix) and using the new MigrateMetadata() function now makes the keys available on remote sites too. Closes: #2219 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-26 11:49:56 +00:00
Niels de Vos	607129171d	rbd: move image metadata key migration to its own function The new MigrateMetadata() function can be used to get the metadata of an image with a deprecated and new key. Renaming metadata keys can be done easily this way. A default value will be set in the image metadata when it is missing completely. But if the deprecated key was set, the data is stored under the new key and the deprecated key is removed. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-26 11:49:56 +00:00
Yati Padia	6691951453	rbd: use go-ceph for getImageMirroringStatus Currently, getImageMirroringStatus() is using RBD CLI. This commit converts RBD CLI to go-ceph API. Fixes: #2120 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-26 06:37:40 +00:00
Prasanna Kumar Kalever	526ff95f10	rbd: add support to expand encrypted volume Previously in ControllerExpandVolume() we had a check for encrypted volumes and we use to fail for all expand requests on an encrypted volume. Also for Block VolumeMode PVCs NodeExpandVolume used to be ignored/skipped. With these changes, we add support for the expansion of encrypted volumes. Also for raw Block VolumeMode PVCs with Encryption we call NodeExpandVolume. That said, With LUKS1, cryptsetup utility doesn't prompt for a passphrase on resizing the crypto mapper device. This is because LUKS1 devices don't use kernel keyring for volume keys. Whereas, LUKS2 devices use kernel keyring for volume key by default, i.e. cryptsetup utility asks for a passphrase if it detects volume key was previously passed to dm-crypt via kernel keyring service, we are overriding the default by --disable-keyring option during cryptsetup open command. So that at the time of crypto mapper device resize we will not be prompted for any passphrase. Fixes: #1469 Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-23 10:00:23 +00:00
Yati Padia	1ae2afe208	cleanup: modifies the error caused due to merged PRs This commit modifies the error of godot, cyclop, paralleltest linter caused due to merged PRs. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-22 18:15:48 +00:00
Rakshith R	43f753760b	cleanup: resolve nlreturn linter issues nlreturn linter requires a new line before return and branch statements except when the return is alone inside a statement group (such as an if statement) to increase code clarity. This commit addresses such issues. Updates: #1586 Signed-off-by: Rakshith R <rar@redhat.com>	2021-07-22 06:05:01 +00:00
Yati Padia	3469dfc753	cleanup: resolve errorlint issues This commit resolves errorlint issues which checks for the code that will cause problems with the error wrapping scheme. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-19 13:31:29 +00:00
Prasanna Kumar Kalever	78f740d903	rbd: improve healer to run multiple NodeStageVolume req concurrently This will bring down the healer run time by a great factor. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	b6a88dd728	rbd: add volume healer Problem: ------- For rbd nbd userspace mounter backends, after a restart of the nodeplugin all the mounts will start seeing IO errors. This is because, for rbd-nbd backends there will be a userspace mount daemon running per volume, post restart of the nodeplugin pod, there is no way to restore the daemons back to life. Solution: -------- The volume healer is a one-time activity that is triggered at the startup time of the rbd nodeplugin. It navigates through the list of volume attachments on the node and acts accordingly. For now, it is limited to nbd type storage only, but it is flexible and can be extended in the future for other backend types as needed. From a few feets above: This solves a severe problem for nbd backed csi volumes. The healer while going through the list of volume attachments on the node, if finds the volume is in attached state and is of type nbd, then it will attempt to fix the rbd-nbd volumes by sending a NodeStageVolume request with the required volume attributes like secrets, device name, image attributes, and etc.. which will finally help start the required rbd-nbd daemons in the nodeplugin csi-rbdplugin container. This will allow reattaching the backend images with the right nbd device, thus allowing the applications to perform IO without any interruptions even after a nodeplugin restart. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	6007fc9bfe	cleanup: move static volume check to helper function Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	6d24080851	rbd: update per volume metadata stash-file with devicePath As part of stage transaction if the mounter is of type nbd, then capture device path after a successful rbd-nbd map. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	70998571aa	cleanup: change variable name from path to metaDataPath path is used by standard package. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Humble Chirammal	61bf49a4f5	rbd: Get rid of locking at nodePublish Considering kubelet make sure the stage and publish operations are serialized, we dont need any extra locking in nodePublish Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-16 07:18:56 +00:00
Humble Chirammal	ef852cc93d	rbd: Get rid of locking at nodeUnpublish call Considering kubelet make sure the unstage and unpublish operations are serialized, we dont need any extra locking in nodeUnpublish Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-07-16 07:18:56 +00:00
Yati Padia	f36d611ef9	cleanup: resolves gofumpt issues of internal codes This PR runs gofumpt for internal folder. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-14 19:50:56 +00:00
Yati Padia	f210d5758b	cleanup: spell check getImageMirroingStatus This commit corrects the spelling for getImageMirroingStatus() -> getImageMirroringStatus Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-14 07:32:01 +00:00
Yati Padia	69c9e5ffb1	cleanup: resolve parallel test issue This commit resolves parallel test issues and also excludes internal/util/conn_pool_test.go as those test can't run in parallel. Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-13 11:31:39 +00:00
Yati Padia	4a649fe17f	cleanup: resolve godot linter This commit resolves godot linter issue which says "Comment should end in a period (godot)". Updates: #1586 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-13 06:50:03 +00:00
Yati Padia	ffab37f44f	cleanup: resolves gocritic linter issues This commit resolves gocritic linter errors. Updates: #2250 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-07-08 05:19:26 +00:00
Madhu Rajanna	dd0884310f	rbd: set image metadata in isThickProvisioned setting metadata in isThickProvisioned method helps us to avoid checking thick metakey and deprecated metakey for both thick and thin provisioned images and also this will easily help us to migrated the deprecated key to new key. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-07 08:31:10 +00:00
Madhu Rajanna	77135599ac	rbd: make setThickProvisioned as method of rbdImage isThickProvisioned is already method of the rbdImage to keep similar thick provisioner related functions common making isThickProvisioned as method of rbdImage. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-07 08:31:10 +00:00
Madhu Rajanna	708800ddc1	rbd: set thick metadata if ThickProvision is set instead of checking the parent is thick provisioned or not we can decide based on the rbdVol generated from the request. If the request is to create a Thick Image. set metadata without checking the parent. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-07 08:31:10 +00:00
Madhu Rajanna	332a47a100	rbd: deprecate .rbd.csi.ceph.com/thick-provisioned metadata key As image metadata key starting with '.rbd' will not be copied when we do clone or mirroring, deprecating the old key for the same reason use 'csi.ceph.com/thick-provisioned' to set image metadata. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-07 08:31:10 +00:00
Madhu Rajanna	0837c05be0	rbd: set scheduling interval on snapshot mirrored image Mirror-snapshots can also be automatically created on a periodic basis if mirror-snapshot schedules are defined. The mirror-snapshot can be scheduled globally, per-pool, or per-image levels. Multiple mirror-snapshot schedules can be defined at any level. To create a mirror-snapshot schedule with rbd, specify the mirror snapshot schedule add command along with an optional pool or image name; interval; and optional start time: The interval can be specified in days, hours, or minutes using d, h, m suffix respectively. The optional start-time can be specified using the ISO 8601 time format. For example: ``` $ rbd --cluster site-a mirror snapshot schedule add --pool image-pool --image image1 24h 14:00:00-05:00 ``` Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-07-06 14:41:48 +00:00
Rakshith R	1b23d78113	rebase: update kubernetes to v1.21.2 Updated kubernetes packages to latest release. resizefs package has been included into k8s.io/mount-utils package. updated code to use the same. Updates: #1968 Signed-off-by: Rakshith R <rar@redhat.com>	2021-07-01 03:35:23 +00:00
Humble Chirammal	8f82a30c21	internal: reformat long lines in internal/rbd package to 120 chars We have many declarations and invocations..etc with long lines which are very difficult to follow while doing code reading. This address the issues in below files, and restrict the line length to 120 chars. -internal/rbd/rbd_attach.go -internal/rbd/rbd_journal.go -internal/rbd/rbd_util.go -internal/rbd/replicationcontrollerserver.go -internal/rbd/snapshot.go Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-06-28 14:43:49 +00:00
Humble Chirammal	e829308249	internal: reformat long lines in internal/rbd package to 120 chars We have many declarations and invocations..etc with long lines which are very difficult to follow while doing code reading. This address the issues in 'internal/rbd/*server.go' and 'internal/rbd/driver.go' files to restrict the line length to 120 chars. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-06-28 14:43:49 +00:00
Rakshith R	404e011ae9	cleanup: added helper func isNotMountPoint Added helper func isNotMountPoint to check mountPoint, validate error and reduce complexity of NodeStageVolume. Signed-off-by: Rakshith R <rar@redhat.com>	2021-06-28 05:46:42 +00:00
Rakshith R	7fc553a3a7	rbd: removing TrimSpace from validateImageFeatures func `imageFeatures` string containing just whitespace should also be treated as a invalid feature. Signed-off-by: Rakshith R <rar@redhat.com>	2021-06-28 05:46:42 +00:00
Rakshith R	84b046d736	rbd: add check for imageFeatures parameter This commit adds checks for missing `imageFeatures` parameter in createvolumerequest and nodestagerequest(only for static PVs). Missing `imageFeatures` parameter is ignored in case of non-static PVs to ensure backwards compatibility with older versions which did not have `imageFeatures` as required parameter. Signed-off-by: Rakshith R <rar@redhat.com>	2021-06-28 05:46:42 +00:00
Yati Padia	13667c013c	cleanup: addresses paralleltest linter The Go linter paralleltest checks that the t.Parallel gets called for the test method and for the range of test cases within the test. Updates: #2025 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-06-25 11:55:12 +00:00
Niels de Vos	0ee0c12027	cleanup: prevent panic in cleanUpSnapshot While cleaning up snapshots, not all object may exist after a partial provisioning attempt. In case objects are missing, do not try to delete them. Fixes: #2192 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-25 10:01:35 +00:00
Niels de Vos	eeec4471c5	rbd: no need to create a snapshot on a thick-provisioned volume When cloning a volume from a (CSI) snapshot, we use DeepCopy() and do not need an RBD snapshot as source. Suggested-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-23 14:22:28 +00:00
Niels de Vos	d2c4cacb39	rbd: restart thick-provisioned PVC snapshot restoring after aborting In case restoring a snapshot of a thick-PVC failed during DeepCopy(), the image will exist, but have partial contents. Only when the image has the thick-provisioned metadata set, it has completed DeepCopy(). When the metadata is missing, the image is deleted, and an error is returned to the caller. Kubernetes will automatically retry provisioning on the ABORTED error, and the restoring will get restarted from the beginning. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-23 14:22:28 +00:00
Niels de Vos	7f1bdb49d1	rbd: use DeepCopy() when restoring a thick-snapshot Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-23 14:22:28 +00:00
Madhu Rajanna	591ba3f580	rbd: set thick provision metadata on clone volume the parent volume(CreateVolume) and the clone volume (CreateSnapshot) are both indepedent and parent volume can be deleted anytime. To check the thick provision during Snapshot restore(CreateVolume from snapshot) we need the thick provision metadata so for the same reason setting the thick provision metadata on the clone image we are creating at the CreateSnapshot time. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-18 10:57:48 +00:00
Madhu Rajanna	6d14eeee70	rbd: use RbdSnapName to check the image details RbdSnapName holds the actual RBD image name which got created during the CreateSnapshot operation. RbdImageName holds the name of the parent from which the snapshot is created. and the parent is independent of snapshot and it can be deleted any time for the same reason using the RbdSnapName to check the rbd image details. generate a temporary volume from the snapshot which replaces the rbdImageName with RbdSnapName and use it to check the image metadata. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-18 10:57:48 +00:00
Madhu Rajanna	7966d2e5c1	rbd: add validation for thick restore/clone added validation to allow only Restore of Thick PVC snapshot to a thick clone and creation of thick clone from thick PVC. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-18 10:57:48 +00:00
Madhu Rajanna	fc442221e4	rbd: make isThickProvisioned method of rbdImage isThickProvisioned can be used for both snapshot and clone validation if isThickProvisioned is method of common rbdImage structure. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-18 10:57:48 +00:00
Niels de Vos	57d3183cb1	rbd: restart thick-provisioned PVC cloning after aborting In case cloning a thick-PVC failed during DeepCopy(), the image will exist, but have partial contents. Only when the image has the thick-provisioned metadata set, it has completed DeepCopy(). When the metadata is missing, the image is deleted, and an error is returned to the caller. Kubernetes will automatically retry provisioning on the ABORTED error, and the cloning will get restarted from the beginning. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Niels de Vos	b1045364d9	rbd: disable FeatureDeepFlatten when doing DeepCopy() Not all Linux kernels support the deep-flatten feature. Disabling the feature makes it possible to map RBD images on older kernels (like what minikube uses). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Niels de Vos	4908ff8743	rbd: no need to flatten thick-provisioned images Thick-provisioned images are independent, cloned images or snapshots are deep-flattened during creation. There is no need to try and flatten them again. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Niels de Vos	6cc11c15d3	rbd: use DeepCopy to create a thick-provisioned clone To create a full-allocated RBD image from a snapshot/clone DeepCopy() can be used. This is needed when the parent of the new volume is thick-provisioner, so that the new volume is independent of the parent and thick-provisioned as well. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Niels de Vos	334f237e23	cleanup: move snapshot/clone/flatten into its own function Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-18 06:25:56 +00:00
Madhu Rajanna	367eb9f748	rbd: correct return error for isCompatibleEncryption isCompatibleEncryption is used to validate the requested volume and the existing volume and the destination volume name wont be generated yet and logging the destination volume prints the empty image name with pool name. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-17 10:12:18 +00:00
Madhu Rajanna	05b8433b89	rbd: check stdErr for does not have a parent error actual error will be present in the stdErr not the error when we try to add a task to flatten the rbd image. This commits corrects the error checking when the image does not have a parent. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-15 11:07:34 +00:00
Yati Padia	6bfdf2feb0	cleanup: gocyclo being unused for linter This commit addresses the following issue: 'nolint:gocyclo // complexity needs to be reduced.' is unused for linter "gocyclo" (nolintlint) Updates:#2025 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-06-15 02:54:16 +00:00
Yug	5c079894c7	doc: correct comment indentation in rbdVolume correct comment indentation in rbdvolume{} Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-15 02:34:51 +00:00
Yug	d992803e9e	rbd: Update pool name in image chain While traversing image chain, the parent image can be present in a different pool that the one child is in. So, updating pool name in the next itteration to that of the Parent. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Yug	1f6a9cabfd	rbd: verify if pool name is not empty Validate Snapshot request to check if the passed pool name is not empty. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Yug	3898ae34a7	rbd: open new ioctx connection if the parent and child clones are in different namespaces we need to open a new ioctx for pools. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Yug	b63b0bf18d	rbd: retrieve parent pool name of child image when clones are created in different pool,we need to retrieve the parent pool to get the information of the parent image. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Yug	e699318acc	rbd: pass parent volume to undoSnapshotCloning function as we are supporting the creation of clone to a new pool we need to pass the correct parent volume to cleanup the snapshot on parent volume. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Yug	961c1d12fd	rbd: add support to create clone in different pool added support to create image in different pool. if the snapshot/rbd image exists in one pool we can create a clone the clone of the rbd image to a different pool. Co-authored-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-10 21:46:53 +00:00
Mohammed Naser	671d6a7767	rbd: Backout if image features is empty In golang world, if you split an empty string that does not contain the seperator, you get an array with one empty string. This results in volumes failing to mount with "invalid feature " (note extra space because it's trying to check if 'empty string' is a valid feature). This patch checks if the string is empty, and if so, it just decides to skip the entire validation and returning nothing. Signed-off-by: Mohammed Naser <mnaser@vexxhost.com>	2021-06-10 15:43:09 +00:00
Mohammed Naser	f193ebfbb1	rbd: Add failing test when no features are provided Signed-off-by: Mohammed Naser <mnaser@vexxhost.com>	2021-06-10 15:43:09 +00:00
Madhu Rajanna	7b5c78ec7c	rbd: fail fast in create volume for missmatch encryption CreateVolume will fail in below cases * If the snapshot is encrypted and requested volume is not encrypted * If the snapshot is not encrypted and requested volume is encrypted * If the parent volume is encrypted and requested volume is not encrypted * If the parent volume is not encrypted and requested volume is encrypted Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-07 15:05:21 +00:00
Yati Padia	0f44c6acb7	cleanup: address wasted assign issues At places variable is reassigned without being used. Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-06-03 09:51:14 +00:00
Niels de Vos	7cbad9305f	rbd: repair thick-provisioned images on CreateVolume restart Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-01 14:42:12 +00:00
Niels de Vos	96a8ea3e88	cleanup: split repairExistingVolume() from CreateVolume() Move the repairing of a volume/snapshot from CreateVolume to its own function. This reduces the complexity of the code, and makes the procedure easier to understand. Further enhancements to repairing an exsiting volume can be done in the new function. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-06-01 14:42:12 +00:00
Madhu Rajanna	2e978e4211	rbd: fix typo in error message fixed typo in error message. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-06-01 10:40:07 +00:00
Rakshith R	b891e5585d	cleanup: address ifshort linter issues This commit addresses ifshort linter issues which checks if short syntax for if-statements is possible. updates: #1586 Signed-off-by: Rakshith R <rar@redhat.com>	2021-05-26 07:04:32 +00:00
Rakshith R	6618e2012d	cleanup: remove unnecessary calling of .String() when logging This commit removes calling of .String() when logging since `%s`,`%v` or `%q` will call an existing .String() function automatically. Fixes: #2051 Signed-off-by: Rakshith R <rar@redhat.com>	2021-05-25 18:02:11 +00:00
Yati Padia	774e8e4042	util: enable golang profiling Add support for golang profiling. Standard tools like go tool pprof and curl work. example: $ go tool pprof http://localhost:8080/debug/pprof/profile $ go tool pprof http://localhost:8080/debug/pprof/heap $ curl http://localhost:8080/debug/pprof/heap?debug=1 https://golang.org/pkg/net/http/pprof/ contains more details about the pprof interface. Fixes: #1699 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-05-25 10:41:22 +00:00
Niels de Vos	25d0a1cfc0	rbd: add support for block-devices in NodeGetVolumeStats() The NodeGetVolumeStats procedure can now be used to fetch the capacity of the RBD block-device. By default this is a thin-provisioned device, which means that the capacity is not reserved in the Ceph cluster. This makes it possible to over-provision the cluster. In order to detect the amount of storage used by the RBD block-device (when thin-provisioned), it is required to connect to the Ceph cluster. Unfortunately, the NodeGetVolumeStats CSI procedure does not provide enough parameters to connect to the Ceph cluster and fetch more details about the RBD image. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-05-25 06:41:04 +00:00
Madhu Rajanna	0ce6ad1152	rbd: fix image details logging log only the required details of the image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-05-07 07:57:37 +00:00
Madhu Rajanna	67d73cd6e9	rbd: flatten image if the depth is not zero flatten the image if the deep-flatten feature is present on the images in the chain or if the images in chain is not zero, as we cannot check the deep-flatten feature the images which are in trash. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-05-07 07:57:37 +00:00
Madhu Rajanna	e15e2e5081	rbd: discard image not found error For flatten we call checkImageChainHasFeature which internally calls to getImageInfo returns the parent name even if the parent is in the trash, when we try to open the parent image to get its information it fails as the image not found. we should treat error as nil if the parent is not found. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-05-07 07:57:37 +00:00
Niels de Vos	f11a041f56	cleanup: address gosec complaint about creating a file The new gosec 2.7.0 complains like: G304 (CWE-22): Potential file inclusion via variable (Confidence: HIGH, Severity: MEDIUM) Updates: #2025 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-05-05 16:05:23 +00:00
Madhu Rajanna	07a916b84d	rbd: mark image ready when image state is up+unknown To recover from split brain (up+error) state the image need to be demoted and requested for resync on site-a and then the image on site-b should gets demoted.The volume should be marked to ready=true when the image state on both the clusters are up+unknown because during the last snapshot syncing the data gets copied first and then image state on the site-a changes to up+unknown. If the image state on both the sites are up+unknown consider that complete data is synced as the last snapshot gets exchanged between the clusters. * create 10 GB of file and validate the data after resync * Do Failover when the site-a goes down * Force promote the image and write data in GiB * Once the site-a comes back, Demote the image and issue resync * Demote the image on site-b * The status will get reflected on the other site when the last snapshot sync happens * The image will go to up+unknown state. and complete data will be copied to site a * Promote the image on site-a and use it ```bash csi-vol-5633715e-a7eb-11eb-bebb-0242ac110006: global_id: e7f9ec55-06ab-46cb-a1ae-784be75ed96d state: up+unknown description: remote image demoted service: a on minicluster1 last_update: 2021-04-28 07:11:56 peer_sites: name: e47e29f4-96e8-44ed-b6c6-edf15c5a91d6-rook-ceph state: up+unknown description: remote image demoted last_update: 2021-04-28 07:11:41 ``` * Do Failover when the site-a goes down * Force promote the image on site-b and write data in GiB * Demote the image on site-b * Once the site-a comes back, Demote the image on site-a * The images on the both site will go to split brain state ```bash csi-vol-37effcb5-a7f1-11eb-bebb-0242ac110006: global_id: 115c3df9-3d4f-4c04-93a7-531b82155ddf state: up+error description: split-brain service: a on minicluster2 last_update: 2021-04-28 07:25:41 peer_sites: name: abbda0f0-0117-4425-8cb2-deb4c853da47-rook-ceph state: up+error description: split-brain last_update: 2021-04-28 07:25:26 ``` * Issue resync * The images cannot be resynced because when we issue resync on site a the image on site-b was in demoted state * To recover from this state (promote and then demote the image on site-b after sometime) ```bash csi-vol-37effcb5-a7f1-11eb-bebb-0242ac110006: global_id: 115c3df9-3d4f-4c04-93a7-531b82155ddf state: up+unknown description: remote image demoted service: a on minicluster1 last_update: 2021-04-28 07:32:56 peer_sites: name: e47e29f4-96e8-44ed-b6c6-edf15c5a91d6-rook-ceph state: up+unknown description: remote image demoted last_update: 2021-04-28 07:32:41 ``` * Once the data is copied we can see that the image state is moved to up+unknown on both sites * Promote the image on site-a and use it Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-05-05 13:38:29 +00:00
Madhu Rajanna	c3bae17fce	rbd: delete encryption key from KMS when a Snapshot is encrypted during a CreateSnapshot operation, the encryption key gets created in the KMS when we delete the Snapshot the key from the KMS should also gets deleted. When we create a volume from snapshot we are copying required information but we missed to copy the encryption information, This commit adds the missing information to delete the encryption key. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-30 08:05:47 +00:00
Humble Chirammal	798437d0c4	rbd: return crypt error for the rpc return At present we return the volume connect error if the clone from snapshot fails when rbdvolume is encrypted, which is incorrect. This patch correctly return the failed copy encryption error to the caller Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-04-21 16:10:20 +00:00
Madhu Rajanna	52290333e6	rbd: modified logic to check image watchers Before RBD map operation, we do check the watchers on the RBD image. In the case of RWO volume. cephcsi makes sure only one client is using the RBD image. If the rbd image is mirrored, by default mirroring daemon will add a watcher on the image and as we are using go-ceph a watcher will be added as we have opened the image So we will have two watchers on an image if mirroring is enabled. This holds when the rbd mirror daemon is running, In case if the mirror daemon is not running there will be only one watcher on the rbd image (which is placed by go-ceph image open) we should not block the map operation if the mirroring daemon is not running as its Async mirroring. This commit adds a check to make sure no more than 2 watchers if the image is mirrored or no more than 1 watcher if it is not mirrored image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-19 16:30:55 +00:00
Yug	6a46f381c2	cleanup: update description to generic Since rbdImage is a common struct for rbdVolume and rbdSnapshot, it description was matching to only snapshot. This commit makes the comments generic for both volumes and snapshots. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-04-19 07:32:35 +00:00
Madhu Rajanna	eea52847bc	rbd: check volumeID in PV if image not found If the pool or few keys are missing in the omap. GetImageAttributes function returns nil error message and few empty items in imageAttributes struct. if the image is not found and the entiries are missing use the volumeId present on the PV annotation for further operations. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-15 17:13:06 +05:30
Madhu Rajanna	cfc88c9910	rbd: discard up+unknown state in ResyncVolume incase if the image is promoted and demoted the image state will be set to up+unknown if the image on the remote cluster is still in demoted state. when user changes the state from primary to secondary and still the image is in demoted (secondary) state in the remote cluster. the image state on both the cluster will be on unknown state. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-15 17:13:06 +05:30
Niels de Vos	8b8480017b	logging: report issues in rbdImage.DEKStore API with stacks It helps to get a stack trace when debugging issues. Certain things are considered bugs in the code (like missing attributes in a struct), and might cause a panic in certain occasions. In this case, a missing string will not panic, but the behaviour will also not be correct (DEKs getting encrypted, but unable to decrypt). Clearly logging this as a BUG is probably better than calling panic(). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	b1d05a1840	rbd: repair encryption config in case it is missing It is possible that when a provisioner restarts after a snapshot was cloned, but before the newly restored image had its encryption metadata set, the new image is not marked as encrypted. This will prevent attaching/mounting the image, as the encryption key will not be fetched, or is not available in the DEKStore. By actively repairing the encryption configuration when needed, this problem should be addressed. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	1482105309	cleanup: use buildCreateVolumeResponse() to simplify CreateVolume() buildCreateVolumeResponse() exists exactly for the need to create a csi.CreateVolumeResponse based on an rbdVolume. Calling this helper reduces the code duplication in CreateVolume(). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	52433841b4	cleanup: move copyEncryptionConfig() from CreateVolume to Exists() The rbdVolume that needs its encryption configured is constructed in the Exists() method. It is suitable to move the copyEncryptionConfig() call there as well, so that the object is completely constructed in a single place. Golang-ci:gocyclo complained about the increased complexity of the Exists() function. Moving the repairing of the ImageID into its own helper function makes the code a little easier to understand. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	596410ae60	cleanup: address "nolint" comments for RBD CreateSnapshot Introduce helper function cloneFromSnapshot() that takes care of the procedures that are needed when an existing snapshot has been found. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	b5d0524c39	cleanup: release resources for rbdImages objects after use Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	dc990037a5	rbd: move setupEncryption() from buildCreateVolumeResponse to CreateVolume Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	bea9d56117	rbd: copyEncryptionConfig in doSnapshotClone() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	fd5f4dbafd	rbd: configureEncryption() in genSnapFromSnapID() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	6fd3f57f40	rbd: set kmsID in reserveSnap() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	0a046c5b6d	rbd: copy encryption configuration in CreateSnapshot Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	6b1285d38b	rbd: copy passphrase for encrypted clones When a source volume is encrypted, the passphrase needs to be copied and stored for the newly cloned volume. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	7b332a0184	rbd: add rbdImage.copyEncryptionConfig() to copy encryption metadata Cloning volumes requires copying the DEK from the source to the newly cloned volume. Introduce copyEncryptionConfig() as a helper for that. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	b6aa19eea5	rbd: pass secrets when creating an source rbdVolume for cloning Without this, the rbdVolume can not connect to the Ceph cluster and configure the (optional) encryption. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	92b2e08adf	rbd: improve logging in deleteImage() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	99da92cfd7	rbd: move deletion of DEK to deleteImage() The ControllerServer should not need to care about support for encryption, ideally it is transparantly handled by the rbdVolume type and its internal API. Deleting the DEK was one of the last remainders that was explicitly done inside the ControllerServer. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Niels de Vos	a7c261a394	logging: correct formatting when reporting error in createVolumeFromSnapshot() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-14 03:59:28 +00:00
Madhu Rajanna	0f8813d89f	rbd:store/Read volumeID in/from PV annotation In the case of the Async DR, the volumeID will not be the same if the clusterID or the PoolID is different, With Earlier implementation, it is expected that the new volumeID mapping is stored in the rados omap pool. In the case of the ControllerExpand or the DeleteVolume Request, the only volumeID will be sent it's not possible to find the corresponding poolID in the new cluster. With This Change, it works as below The csi-rbdplugin-controller will watch for the PV objects, when there are any PV objects created it will check the omap already exists, If the omap doesn't exist it will generate the new volumeID and it checks for the volumeID mapping entry in the PV annotation, if the mapping does not exist, it will add the new entry to the PV annotation. The cephcsi will check for the PV annotations if the omap does not exist if the mapping exists in the PV annotation, it will use the new volumeID for further operations. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-07 11:46:27 +00:00
Rakshith R	020cded581	cleanup: refactor deeply nested if statements in internal/rbd Refactored deeply nested if statement in internal/rbd to reduce cognitive complexity. Signed-off-by: Rakshith R <rar@redhat.com>	2021-04-07 02:31:41 +00:00
Niels de Vos	aaeb35eceb	rbd: encrypted volumes can be of type "crypto_LUKS" too It seems that newer versions of some tools/libraries identify encrypted filesystems with `crypto_LUKS` instead of `crypt`. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-04-06 15:54:27 +00:00
Madhu Rajanna	d7838defcf	rbd: return FailedPrecondition error message In case of the DR the image on the primary site cannot be demoted as the cluster is down, during failover the image need to be force promoted. RBD returns `Device or resource busy` error message if the image cannot be promoted for above reason. Return FailedPrecondition so that replication operator can send request to force promote the image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-06 14:12:41 +00:00
Madhu Rajanna	403532c9a6	rbd: use force from PromoteVolume Request instead of fetching the force option from the parameters. Use the Force field available in the PromoteVolume Request. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-06 14:12:41 +00:00
Madhu Rajanna	385a751b8e	rebase: rename kube-storage to csi-addons as the org github.com/kube-storage is renamed to github.com/csi-addons as the name kube-storage was more generic. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-06 10:59:58 +00:00
Humble Chirammal	314fe0e23d	cleanup: correct misspelling in rbd/clone.go Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-04-05 09:34:09 +00:00
Madhu Rajanna	448be70682	rbd: early check for disabled,disabling in DisableVolumeReplication added early check for disabling and disabled image mirroring state in DisableVolumeReplication Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Madhu Rajanna	fb3f7fe202	rbd: remove todo for image not found Incase of resync the image will get deleted, gets recreated and its a a time consuming operation. It makes sense to return aborted error instead of not found as we have omap data only the image is missing in rbd pool. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Madhu Rajanna	95387c3b5e	rbd: check for peer site status Do resync if the image is in unknow or in error state. Check for the current image state for up+stopped or up+replaying and also all peer site status should be un up+stopped to confirm that resyncing is done and image can be promoted and used. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Madhu Rajanna	233954bc10	rbd: make replication operations as rbdImage methods added replication related operations as a method of rbdImage as these methods can be easily used when we introduce volumesnaphot mirroring operations. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Madhu Rajanna	c822ad460d	rbd: add a check for image mirror disabling state the rbd mirror state can be in enabled,disabled or disabling state. If the mirroring is not disabled yet and still in disabling state. we need to check for it and return abort error message if the mirroring is still getting disabled. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Madhu Rajanna	aaf6b571b8	rbd: Add ReplicationServer struct for replication operations added ReplicationServer struct for the replication related operation it also embed the ControllerServer which already implements the helper functions like locking/unlocking etc. removed getVolumeFromID and cleanup functions for better code readability and easy maintaince. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-04-05 08:53:40 +00:00
Niels de Vos	eea97ca014	util: move GetID() from EncryptionKMS to VolumeEncryption There is no need for each EncryptionKMS to implement the same GetID() function. We have a VolumeEncryption type that is more suitable for keeping track of the KMS-ID that was used to get the configuration of the KMS. This does not change any metadata that is stored anywhere. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-24 12:09:04 +00:00
Madhu Rajanna	d8f7b38d3d	rbd: add exclusive-lock and journaling image features for rbd image Current rbd plugin only supports the layering feature for rbd image. Add exclusive-lock and journaling image features for the rbd. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: woohhan <woohyung_han@tmax.co.kr>	2021-03-24 09:48:04 +00:00
Niels de Vos	bbd24e52f3	cleanup: use rbdImage.Destroy() for temporary volumes rbdVolumes can have several resources that get allocated during its usage. Only destroying the IOContext may not be suffiecient and can cause resource leaks. Use rbdVolume.Destroy() when the rbdVolume is not used anymore. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-17 07:50:09 +00:00
Niels de Vos	5c26fbb0d7	util: use ClusterConnection.Copy() for re-using connections Connections are reference counted, so just assigning the connection to an other object for re-use is not correct. This can cause connections to be garbage collected while something else is still using it. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-17 07:50:09 +00:00
Madhu Rajanna	6e941539b5	rbd: implement volume replication spec implemented the volume replication spec for the rbd mirroring. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-16 13:06:44 +00:00
Madhu Rajanna	ee9a200fcc	rbd: implement mirroring helpers with go-ceph mirror.go exposes the helper functions to perform the mirroring operations. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-16 13:06:44 +00:00
Madhu Rajanna	c642637cec	util: register replication controller as RBD is implementing the replication we are registering it. For CephFS, its not implementing the replication we are passing nil so we dont want to register it. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-16 13:06:44 +00:00
Madhu Rajanna	ee0576278f	cleanup: move servers to a new struct For future enhancments like adding more servers. Moving the list of servers to a new structure. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-16 13:06:44 +00:00
Niels de Vos	10a75dd4ff	rbd: introduce rbdImage as base for rbdVolume and rbdSnapshot Because rbdVolume and rbdSnapshot are very similar, they can be based off a common struct rbdImage that contains the common attributes and functions. This makes it possible to re-use functions for snapshots, and prevents further duplication or code. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-15 08:10:51 +00:00
Niels de Vos	3fe714c4fa	cleanup: rename rbdSnapshot.SnapID to VolID The rbdSnapshot and rbdVolume structs have many common attributes. In order to combine these into an rbdImage struct that implements shared functionality, having the same attribute for the ID makes things much easier. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-15 08:10:51 +00:00
Niels de Vos	cffec0b3f3	rbd: configure the DEKStore if the configuration suggests to use metadata NewVolumeEncryption() will return an indication that an alternative DEKStore needs to be configured in case the KMS does not support it. setKMS() will also set the DEKStore if needed, so renaming it to configureEncryption() makes things clearer. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-12 10:11:47 +00:00
Niels de Vos	e4431edaf9	rbd: implement the DEKStore interface To accommodate storing DEKs outside a KMS, the DEK can be stored in the metadata of the volume. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-12 10:11:47 +00:00
Niels de Vos	9ac7f56400	util: move existing KMS implementations to the DEKStore interface Use DEKStore API for Fetching and Storing passphrases. Drop the fallback for the old KMS interface that is now provided as DEKStore. The original implementation has been re-used for the DEKStore interface. This also moves GetCryptoPassphrase/StoreNewCryptoPassphrase functions to methods of VolumeEncryption. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-12 10:11:47 +00:00
Niels de Vos	d4076d6216	util: introduce VolumeEncryption type Prepare for grouping encryption related functions together. The main rbdVolume object should not be cluttered with KMS or DEK procedures. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-12 10:11:47 +00:00
Madhu Rajanna	cbb10fd84d	rbd: add more logging for NodeUnstageVolume For NodeUnstageVolume its a two step process, first unmount the volume and than unmap the volume. Currently, we are logging only after rbd unmapping is done. sometimes it becomes difficult to debug with above logging whether more time is spent in unmount or unmap. This commits adds one more debug log after unmount is done. with this we can identify where exactly more time is spent by looking at the logs. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-11 17:40:57 +00:00
Niels de Vos	fe0f169875	rbd: write max 1gb per WriteSame() operation It seems that writing more than 1 GiB per WriteSame() operation causes an EINVAL (22) "Invalid argument" error. Splitting the writes in blocks of maximum 1 GiB should prevent that from happening. Not all volumes are of a size that is the multiple of the stripe-size. WriteSame() needs to write full blocks of data, so in case there is a small left-over, it will be filled with WriteAt(). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-11 10:57:31 +00:00
Niels de Vos	165a837bca	rbd: move KMS initialization into rbdVol.initKMS() Introduce initKMS() as a function of rbdVolume. KMS functionality does not need to pollute general RBD image functions. Encryption functions are now in internal/rbd.encryption.go, so move initKMS() there as well. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-24 13:16:11 +00:00
Niels de Vos	cf6dae86e9	rbd: move encryptDevice() to a method of rbdVolume Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-24 13:16:11 +00:00
Niels de Vos	fb065b0f39	rbd: move openEncryptedDevice() to a method of rbdVolume Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-24 13:16:11 +00:00
Niels de Vos	b5020657e6	rbd: add "--options notrim" when mapping a thick-provisioned image Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Niels de Vos	cc96bdaac3	rbd: allocate extents when expanding an image When and RBD image is expanded, the additional extents need to get allocated when the image was thick-provisioned. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Niels de Vos	294a0973bd	rbd: mark images thick-provisioned in metadata When images get resized/expanded, the additional space needs to be allocated if the image was initially thick-provisioned. By marking the image with a "thick-provisioned" key in the metadata, future operations can check the need. A missing "thick-provisioned" key indicates that the image has not been thick-provisioned. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Niels de Vos	74d218df8d	rbd: disable rbd_discard_on_zeroed_write_same for thick-allocation Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Niels de Vos	5522a05f59	rbd: thick-provision images on request Write blocks of stripe-size to allocate RBD images when Thick-Provisioning is enabled in the StorageClass. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Madhu Rajanna	c417a5d0ba	rbd: add support for thick provisioning option Add an option to the StorageClass to support creating fully allocated (thick provisioned) RBD images Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-19 11:55:40 +00:00
Niels de Vos	4937e59c4d	rbd: add backwards compatible encryption in NodeStageVolume When a volume was provisioned by an old Ceph-CSI provisioner, the metadata of the RBD image will contain `requiresEncryption` to indicate a passphrase needs to be created. New Ceph-CSI provisioners create the passphrase in the CreateVolume request, and set `encryptionPrepared` instead. When a new node-plugin detects that `requiresEncryption` is set in the RBD image metadata, it will fallback to the old behaviour. In case `encryptionPrepared` is read from the RBD image metadata, the passphrase is used to cryptsetup/format the image. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-17 17:51:13 +00:00
Niels de Vos	ee79b22c97	rbd: move encryption function to encryption.go This adds internal/rbd/encryption.go which will be used to include other encryption functionality to support additional KMS related functions. It will work together with the shared API from internal/util/kms.go. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-17 17:51:13 +00:00
Niels de Vos	9b6c2117f3	rbd: set encryption passphrase on CreateVolume Have the provisioner create the passphrase for the volume, instead of doign it lazily at the time the volume is used for the 1st time. This prevents potential races where pods on different nodes try to store different passphrases at the (almost) same time. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-17 17:51:13 +00:00
Niels de Vos	d534ee9ce8	rbd: include rados-namespace when calling addRbdManagerTask() It seems that calls to addRbdManagerTask() do not include the rados-namespace in the image location. Functions calling addRbdManagerTask() construct the image location themselves, but should use rbdVolume.String() to include all the attributes. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-12 12:02:14 +00:00
Niels de Vos	8d0b39e690	rbd: log error when scheduling flattening fails Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-02-12 12:02:14 +00:00
Niels de Vos	0b7521162c	cleanup: rewrite ifElseChains to switch statements Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-01-27 13:03:56 +00:00
Ilya Dryomov	04644c1d58	rbd: enable mapping and unmapping from a network namespace Make rbdplugin pod work in a non-initial network namespace (i.e. with "hostNetwork: false") by skipping waiting for udev events when mapping and unmapping images. CSI use case is very simple: all that is needed is a device node which is immediately fed to mkfs, so we should be able to tolerate udev not being finished with the device just fine. Fixes: #1323 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2021-01-07 15:34:05 +00:00
Ilya Dryomov	c2493686b7	rbd: introduce appendDeviceTypeAndOptions() Factor out --device-type and --options formatting. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2021-01-07 15:34:05 +00:00
Ilya Dryomov	d3f31187fc	rbd: rename ndbType parameter Fix "ndb" typo. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2021-01-07 15:34:05 +00:00
Ilya Dryomov	5631b83dd0	rbd: rename mapOptions and options argument slices With the new support for passing --options, referring to ExecCommand() argument slices as mapOptions and options is confusing. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2021-01-07 15:34:05 +00:00
Seena Fallah	fdec9f65b8	rbd: fix namespace json parser for xbdDeviceInfo rbd device list --format=json returns namespace as a namespace not radosNamespace Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2021-01-05 11:26:09 +00:00
Madhu Rajanna	9c7176dbb4	rbd: update mount packges in import path mount packges is moved from k8s.io/utils/mount to a new repository k8s.io/mount-utils. updated code to use the same. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-17 16:04:54 +00:00
Niels de Vos	8f91c672d4	util: add EncryptionKMS.Destroy() Add a new method to the EncryptionKMS interface so that resources can be freed when EncryptionKMS instances get freed. With the move to using the libopenstorage API, a temporary file needs to store the optional CA certificate. The Destroy() method of the vaultConnection type now removes this file. The rbdVolume uses the EncryptionKMS type now, so call the new Destroy() method from withing rbdVolume.Destroy(). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-14 14:45:09 +00:00
Niels de Vos	f08182e2fc	rbd: pass Owner to GetKMS() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-14 14:45:09 +00:00
Niels de Vos	16cb43f0f9	rbd: store csi.storage.k8s.io/pvc/namespace metadata as Owner The Owner of an RBD image (Kubernetes Namespace, tenant) can be used to identify additional configuration options. This will be used for fetching the right Vault Token when encrypting/decrypting volumes. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 13:58:48 +00:00
Niels de Vos	f8ebc6aa3f	cleanup: return error type in ensureEncryptionMetadataSet() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	d8e443ab49	cleanup: return error type in cleanupRBDImageMetadataStash() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	f262673b60	cleanup: return error type in lookupRBDImageMetadataStash() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	8e589587ae	cleanup: return error type in stashRBDImageMetadata() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	57ce07f54e	cleanup: return error type in updateVolWithImageInfo() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	610162b5f4	cleanup: return error type in genVolFromVolumeOptions() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	729e2419ef	cleanup: return error type in detachRBDImageOrDeviceSpec() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	7eae69f10c	cleanup: return error type in rbdGetDeviceList() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	4dde3fc9e0	cleanup: return error type in encryptDevice() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	d6fb8f302d	cleanup: return error type in NodeServer.processEncryptedDevice() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	8019e4d1bc	rbd: return CSI status-error on resize failure Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	65a10fd553	cleanup: standardize error format in NodeServer.NodeStageVolume() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Niels de Vos	cc3f146ad1	cleanup: return error type in rbdVolume.checkCloneImage() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 08:35:35 +00:00
Madhu Rajanna	43fde0a30a	cleanup: add a helper function storeImageID added a helper function storeImageID to reduce code duplication. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-07 11:03:27 +00:00
Madhu Rajanna	c40872df00	rbd: undo reservation incase of errors If cephcsi encounters any error after reservation, as a cleanup operation it should revert back the reservation. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-07 11:03:27 +00:00
Madhu Rajanna	99dbe27921	rbd: return nil if the omap data exists If the omap data already exits return nil. so that omap generator will not try to reserve anything again. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-07 11:03:27 +00:00
Madhu Rajanna	8ebb9a1ba0	cleanup: fix misspell words fixed misspell words detected by codespell Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-30 08:46:48 +01:00
Madhu Rajanna	39b1f2b4d3	cleanup: fix mispell words fixed mispell words in the repo. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-29 12:47:46 +05:30
Madhu Rajanna	6091490393	rbd: improve logging in getCloneDepth earlier if the depth check fails the complete vol struct was getting logged, this commits logs only the pool and image name. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	b3120926b9	rbd: remove extra Destory of parent volume removed extra Destory of the parent volume. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	68bd44beba	rbd: add new controller to regenerate omap data In the case of Disaster Recovery failover, the user expected to create the static PVC's. We have planned not to go with the PVC name and namespace for many reasons (as in kubernetes it's planned to support PVC transfer to a new namespace with a different name and with new features coming in like data populator etc). For now, we are planning to go with static PVC's to support async mirroring. During Async mirroring only the RBD images are mirrored to the secondary site, and when the user creates the static PVC's on the failover we need to regenerate the omap data. The volumeHandler in PV spec is an encoded string which contains clusterID and poolID and image UUID, The clusterID and poolID won't remain same on both the clusters, for that cephcsi need to generate the new volume handler and its to create a mapping between new volume handler and old volume handler with that whenever cephcsi gets csi requests it check if the mapping exists it will pull the new volume handler and continues other operations. The new controller watches for the PVs created, It checks if the omap exists if it doesn't it will regenerate the entire omap data. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	14700b89d1	rbd: update inuse logic of a rbd image in case of mirrored image, if the image is primary a watcher will be added by the rbd mirror deamon on the rbd image. we have to consider 2 watcher to check image is in use. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	ba84f14241	journal: create object with provided UUID incase of async mirroring the volume UUID is retrieved from the volume name, instead of cephcsi generating a new UUID it should reserve the passed UUID it will be useful when we support both metro DR and async mirroring on a kubernetes clusters. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Madhu Rajanna	8d3a44d0c4	rbd: add minsnapshotsonimage flag An rbd image can have a maximum number of snapshots defined by maxsnapshotsonimage On the limit is reached the cephcsi will start flattening the older snapshots and returns the ABORT error message, The Request comes after this as to wait till all the images are flattened (this will increase the PVC creation time. Instead of waiting till the maximum snapshots on an RBD image, we can have a soft limit, once the limit reached cephcsi will start flattening the task to break the chain. With this PVC creation time will only be affected when the hard limit (minsnapshotsonimage) reached. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-18 05:59:20 +00:00
Mudit Agarwal	0ecfd0e72c	rbd: replace go-ceph GetParentInfo() with GetParent() GetParent() is a newer and better version of GetParentInfo() in go-ceph. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-11-03 08:00:12 +00:00
Humble Chirammal	25617929f0	rbd: use different variable instead of builtin `cap` function `cap` builtin function returns the capacity of a type. Its not good practice to use this builtin function for other variable names, removing it here Ref# https://golang.org/pkg/builtin/#cap Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-21 12:30:18 +00:00
Humble Chirammal	70358c8eb7	rbd: volJournal.Connect() return wrongly pushed to caller volJournal.Connect() got the error on err2 variable, however the return was on variable err which hold the error return of DecomposeCSIID() which is wrong. This cause the error return wrongly parsed and pushed from the caller. From now on, we are reusing the err variable to hold and revert the error of volJournal.Connect(). Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-20 12:45:51 +00:00
Humble Chirammal	5c73f0e41b	rbd: correct the code comment for ErrFlattenInProgress Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-20 08:59:25 +00:00
Humble Chirammal	75c6cb6544	rbd: remove unused credentials argument from StoreImageID() Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-15 06:46:41 +00:00
Humble Chirammal	4cb7d2bbbd	rbd: remove unwanted condition check in flattenCloneImage All the previous condition checks exit from the function and when it reach to this block its obvious that error is non nil, we dont need an extra check here. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-15 06:20:47 +00:00
Humble Chirammal	be9195cf32	rbd: remove false error condition check in checkSnapCloneExists Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 12:56:51 +00:00
Humble Chirammal	7992033156	rbd: remove false error condition check in rbdVol.Exists() Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 12:56:51 +00:00
Humble Chirammal	c9b7b6561a	rbd: remove false error condition check in genVolFromVolID The error check condition in genVolFromID() is always false as far as code reading/workflow goes. Removing it for the same reason. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 12:56:51 +00:00
Humble Chirammal	364b3afcaa	rbd: correct typo in validateRequestedVolumeSize() Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 08:58:05 +00:00
Marton Natko	91c49c14b3	rbd: fix problem with deletion of a namespaced image fix problem with deletion of a namespaced image Signed-off-by: Marton Natko <mnatko@transip.nl> Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-09-25 15:35:22 +00:00
Marton Natko	2fbca0d1e2	rbd: fix typo in createVolume validation Signed-off-by: Marton Natko <mnatko@transip.nl> Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-09-25 15:35:22 +00:00
Mudit Agarwal	e1237f348f	rbd: replace cli with go-ceph for snapshot namespace cephcsi uses cli for fetching snap list as well as to check the snapshot namespace, replaced that with go-ceph calls. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-09-25 08:07:32 +00:00
Madhu Rajanna	d1f175d9f3	rbd: add support for rbd map and unmap options added support for providing map and unmap options to rbd CLI when mapping rbd image on the node. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-09-21 13:27:28 +00:00
Madhu Rajanna	2deabcd284	rbd: allow readonly only for volume with contentsource It doesnot make sense to allow the creation of empty volumes which is going to be accessed with readonly mode, this commit allows the creation of volume which is having readonly capabilities only if the content source is set for the volume. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-09-10 13:47:23 +00:00
Mudit Agarwal	4de1abad5e	rbd: NodeExpandVolume() should use StagingTargetPath Form kubernetes v1.19 onwards NodeRequest is getting volume path in StagingTargetPath instead of VolumePath, cephcsi should also use the same. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-08-25 15:58:44 +00:00
Mudit Agarwal	8c2bdbc001	rbd: implement rbd.updateVolWithImageInfo() with go-ceph updateVolWithImageInfo() is currently executing an rbd command to get details about an RBD image. Replaced it with the required go-ceph functions. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-08-25 07:06:40 +00:00
Madhu Rajanna	38892c3068	rbd: rearrange the cleanup order incase clone fails in case of clone failure, we need to first delete the clone and the snapshot from which we created the clone, then as part of cleanup we need to remove the temporary cloned image and the temporary snapshot created on the parent image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-24 08:16:59 +00:00
Madhu Rajanna	a7b65087af	rbd: fix return error message incase of flattening error return the flatten error. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-24 08:16:59 +00:00
Madhu Rajanna	7c9fee17af	rbd: replace klog with util logger in snapshot.go rbd: replace klog with util logger in snapshot.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	9305e27797	rbd: replace klog with util logger in rbd_util.go rbd: replace klog with util logger in rbd_util.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	90cc5a4fd0	rbd: replace klog with util logger in rbd_journal.go replace klog with util logger in rbd_journal.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	9512aa6206	rbd: replace klog with util logger in rbd_attach.go replace klog with util logger in rbd_attach.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	e768c0dfc0	rbd: replace klog with util logger in nodeserver.go replace klog with util logger in nodeserver.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	883485b082	rbd: replace klog with util logger in driver.go replace klog with util logger in driver.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	c4f30c23c1	rbd: replace klog with util logger in clone.go replace klog with util logger in clone.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Madhu Rajanna	d8ca56fbe9	rbd: replace klog with util in controllerserver.go replace klog with util in controllerserver.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 14:38:12 +00:00
Humble Chirammal	1f5b84745f	rbd: Bail out from nodeexpansion if its block mode pvc At CSI spec < 1.2.0, there was no volumecapability in the expand request. However its available from v1.2+ which allows us to declare the node operations based on the volume mode. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-08-19 12:34:20 +00:00
Madhu Rajanna	d4186a6880	rbd: discard flattening error if the image doesnt have parent if the image is created without flattening image-feature the image will get few image-features by default, deep-flatten is one of them. if the image doesnot have any parent the rbd image flattening will fail, This commit discards error message if the image doesnt have any parent. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-13 08:37:53 +00:00
Madhu Rajanna	8a581ec6c0	rbd: fix build issue for rbd merging of https://github.com/ceph/ceph-csi/pull/1035 broken the cephcsi building. This commits fixes the build issue. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-12 19:53:48 +05:30
Mehdy Khoshnoody	fc5eadf106	rbd: Add rados namespace support for rbd Make sure to operate within the namespace if any given when dealing with rbd images and snapshots and their journals. Signed-off-by: Mehdy Khoshnoody <mehdy.khoshnoody@gmail.com>	2020-08-12 16:22:58 +05:30
Madhu Rajanna	bfde065f92	util: move getMonsAndClusterID to util as we had duplicate functions in both cephfs and rbd this commit moves the function to util. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-11 10:02:53 +00:00
Madhu Rajanna	18f4e9d519	util: move csiconfigfile to util as we have csiconfigfile in both cephfs and rbd moving the configfile path to util folder. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-11 10:02:53 +00:00
Madhu Rajanna	2458ec6573	rbd: return error if fetching cluster id fails if we are not able to fetch the cluster-ID from the createSnapshot request and also if we are not able to get the monitor information from the cluster-ID return error instead of using the parent image information. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-03 14:25:06 +00:00
Mudit Agarwal	9ed0811422	rbd: implement rbdVolume.resize() with go-ceph Replaced command execution with go-ceph Resize() function. Volsize is being updated before waiting for resize() to return, fixed it to get updated only after resize() is successful. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-08-03 10:50:01 +00:00
Niels de Vos	be9e7cf956	rbd: pass context.Context to rbdVolume.resize() While adding the context.Context to the resizeRBDimage() function, it became a little ugly. So renaming the function to resize() and making it a method of the rbdVolume type. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 16:04:13 +00:00
Niels de Vos	36469b87e2	util: make ExecComand return stdout and stderr as string Most consumers of util.ExecCommand() need to convert the returned []byte format of stdout and/or stderr to string. By having util.ExecCommand() return strings instead, the code gets a little simpler. A few commands return JSON that needs to be parsed. These commands will be replaced by go-ceph implementations later on. For now, convert the strings back to []byte when needed. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 16:04:13 +00:00
Niels de Vos	ddac66d76b	util: use context.Context for logging in ExecCommand All calls to util.ExecCommand() now pass the context.Context. In some cases this is not possible or needed, and util.ExecCommand() will not log the command. This should make debugging easier when command executions fail. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 16:04:13 +00:00
Niels de Vos	bb4f1c7c9d	rbd: use util.ExecCommand() instead of execCommand() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 16:04:13 +00:00
Niels de Vos	47d5b60af8	rbd: disable reflink while creating XFS filesystems Current versions of the mkfs.xfs binary enable reflink support by default. This causes problems on systems where the kernel does not support this feature. When the kernel the feature does not support, but the filesystem has it enabled, the following error is logged in `dmesg`: XFS: Superblock has unknown read-only compatible features (0x4) enabled Introduce a check to see if mkfs.xfs supports the `-m reflink=` option. In case it does, pass `-m reflink=0` while creating the filesystem. The check is executed once during the first XFS filesystem creation. The result of the check is cached until the nodeserver restarts. Fixes: #966 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 13:37:51 +00:00
Niels de Vos	526da43b6a	rbd: remove unused rbdStatus() Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 11:34:48 +00:00
Niels de Vos	7afaac9c66	rbd: implement rbdVolume.isInUse() with go-ceph The new rbdVolume.isInUse() method will replace the rbdStatus() function. This removes one more rbd command execution in the DeleteVolume path. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-24 11:34:48 +00:00
Sven Anderson	92884f56f4	rbd: simplify error handling This change replaces the sentinel errors in rbd module with standard errors created with errors.New(). Related: #1203 Signed-off-by: Sven Anderson <sven@redhat.com>	2020-07-23 11:16:40 +00:00
Sven Anderson	8393fbe40b	util: simplify error handling The sentinel error code had additional fields in the errors, that are used nowhere. This leads to unneccesarily complicated code. This change replaces the sentinel errors in utils with standard errors created with errors.New() and adds a simple JoinErrors() function to be able to combine sentinel errors from different code tiers. Related: #1203 Signed-off-by: Sven Anderson <sven@redhat.com>	2020-07-23 11:16:40 +00:00
Madhu Rajanna	b3a4f510e6	rbd: take operation locks before operating on resource Take operation locks on the resources before operating on the resouces. This allows us to do parallel operations for some RPC calls such as Clone and Restore of PVC. This operations will only be blocked if the image is expanding or Snapshot and RBD image is getting deleted. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-21 13:25:19 +00:00
Yug	71ddf51544	cleanup: address gomnd warnings Direct usage of numbers should be avoided. Issue reported: mnd: Magic number: X, in <argument> detected (gomnd) Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Yug	48fa43270f	cleanup: address gocritic warnings Add explanation to nolint directives. Issue reported: whyNoLint: include an explanation for nolint directive (gocritic) Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Yug	7f94a57908	cleanup: address godot warnings Top level comments should end in a period Signed-off-by: Yug <yuggupta27@gmail.com>	2020-07-21 08:36:24 +00:00
Madhu Rajanna	11a6f6c1dd	rbd: Support data-pool when cloning rbd image Added support to clone an image in data-pool during CreateVolume RPC call. updates #1188 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-20 17:29:48 +00:00
Madhu Rajanna	684cb13c54	rbd: DisAllow CreateVoulume for missmatch volume size If the requested volume size and the snapshot or the parent volume from which the clone is to be created is not equal cephcsi returns an error message. updates #1188 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-16 12:33:27 +00:00
Madhu Rajanna	5208c0fc38	cleanup: replace klog with v2 This commit replaces the klog with klog/v2 in leftover place. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-16 04:10:58 +00:00
Madhu Rajanna	2c67ba1ec4	rbd: Return current depth if the image is not found If the image in the chain is moved to trash, we cannot get the image details. We need to return the found depth to the caller. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-15 18:40:45 +00:00
Madhu Rajanna	76c2f3c109	cleanup: re-use flattenTemporaryClonedImages to reduce duplicate code re-use flattenTemporaryClonedImages to avoid code duplication Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-15 16:10:38 +00:00
Madhu Rajanna	8fc9146056	rbd: flatten temp cloned images If the snapshots on the parent image exceeds maxSnapshotsOnImage count, we need to flatten all the temporary cloned images to over come the krbd issue of maximum number of snapshots on an image. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-15 16:10:38 +00:00
Madhu Rajanna	2fe1ee5287	rbd: create temporary snapshot with name same as temporary clone create temporary snapshot on the parent image same as name as the temporary clone rbd image. Naming the snapshot and the temporary cloned image helps to flatten the temporary cloned images when the snapshots on the parent image exceeds the configured maxSnapshotsOnImage. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-15 16:10:38 +00:00

... 3 4 5 6 7 ...

514 Commits