ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-04-11 01:53:00 +00:00

Author	SHA1	Message	Date
Humble Chirammal	d1c8cce659	Format RBD volumes with `nodiscard` formatting options. Currently rbd CSI plugin uses formatAndMount of mount.SafeFormatAndMount. This does not allow to pass or use specific formatting arguments with it. This patch introduce RBD specific formatting options with both xfs and ext4, for example: -E no-discard with ext4 and -k option with XFS to boost formatting performance of RBD device. Signed-off-by: Humble Chirammal <hchiramm@redhat.com> (cherry picked from commit `0e6617e1ff`)	2019-09-24 14:41:36 +05:30
Madhu Rajanna	1ae09be924	Change the logic of locking if any on going opearation is seen,we have to return Abort error message Signed-off-by: Madhu Rajanna <madhupr007@gmail.com> (cherry picked from commit `6aac399075`)	2019-09-23 12:38:51 +00:00
Poornima G	82b85d536f	Add mount option for Cephfs The storage class already takes MountOptions(MountFlags), these are the bind mount options. Some of these options may not be recognised by the cephfs mount. Hence added a new parameterin Storage Class for - cephfs kernel mount options, - ceph-fuse mount options Ceph kernel mount options are different from ceph-fuse options, hence added two different parameters. Signed-off-by: Poornima G <pgurusid@redhat.com>	2019-09-06 22:03:31 +05:30
Daniel-Pivonka	18e22e500e	Fix liveness connection opening endless sockets Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-09-06 09:56:31 +05:30
Madhu Rajanna	b90ddd7ade	Remove volumemounter flag from cephfs Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-09-05 13:04:44 +05:30
Poornima G	90c4d6a451	Cephfs: Use ceph kernel client if kernel version >= 4.17 Ceph kernel client is more performant than ceph fuse client. The kernel client has Quota support only in the kernel version >=4.17. Hence use ceph kernel client when the kernel version is >=4.17. Signed-off-by: Poornima G <pgurusid@redhat.com>	2019-09-05 04:55:05 +00:00
Niels de Vos	dd668e59f1	Address security concerns reported by 'gosec' gosec reports several issues, none of them looks very critical. With this change the following concerns have been addressed: [pkg/cephfs/nodeserver.go:229] - G302: Expect file permissions to be 0600 or less (Confidence: HIGH, Severity: MEDIUM) > os.Chmod(targetPath, 0777) [pkg/cephfs/util.go:39] - G204: Subprocess launched with variable (Confidence: HIGH, Severity: MEDIUM) > exec.Command(program, args...) [pkg/rbd/nodeserver.go:156] - G302: Expect file permissions to be 0600 or less (Confidence: HIGH, Severity: MEDIUM) > os.Chmod(stagingTargetPath, 0777) [pkg/rbd/nodeserver.go:205] - G302: Expect file permissions to be 0600 or less (Confidence: HIGH, Severity: MEDIUM) > os.OpenFile(mountPath, os.O_CREATE\|os.O_RDWR, 0750) [pkg/rbd/rbd_util.go:797] - G304: Potential file inclusion via variable (Confidence: HIGH, Severity: MEDIUM) > ioutil.ReadFile(fPath) [pkg/util/cephcmds.go:35] - G204: Subprocess launched with variable (Confidence: HIGH, Severity: MEDIUM) > exec.Command(program, args...) [pkg/util/credentials.go:47] - G104: Errors unhandled. (Confidence: HIGH, Severity: LOW) > os.Remove(tmpfile.Name()) [pkg/util/credentials.go:92] - G104: Errors unhandled. (Confidence: HIGH, Severity: LOW) > os.Remove(cr.KeyFile) [pkg/util/pidlimit.go:74] - G304: Potential file inclusion via variable (Confidence: HIGH, Severity: MEDIUM) > os.Open(pidsMax) URL: https://github.com/securego/gosec Signed-off-by: Niels de Vos <ndevos@redhat.com>	2019-09-04 11:48:37 +00:00
Madhu Rajanna	a81a3bf96b	implement grpc metrics for ceph-csi Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-30 06:50:32 +00:00
Daniel-Pivonka	01a78cace5	switch to cephfs, utils, and csicommon to new loging system Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-08-29 14:04:31 +00:00
Madhu Rajanna	3af364e7b5	move to statand context package Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-26 06:19:24 +00:00
Madhu Rajanna	38ca08bf65	Context based logging for rbd Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-26 06:19:24 +00:00
Daniel-Pivonka	81c28d6cb0	implement klog wrapper Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-08-21 14:36:41 +00:00
Daniel-Pivonka	aa74f8c87f	Implement context based logging Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-08-21 14:36:41 +00:00
Madhu Rajanna	e557438f87	unmap rbd image if connection timeout. Sometime rbd images are mapped even if the connection timeout error occurs, this will try to unmap if the received error message is connection timeout.This will fix stale maps and rbd image deletion issue Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 10:54:17 +00:00
Madhu Rajanna	0da4bd5151	start controller or node server based on config if both controller and nodeserver flags are set/unset cephcsi will start both server, if only one flag is set, it will start relavent service. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
Madhu Rajanna	89732d923f	move flag configuration variable to util remove unwanted checks remove getting drivertype from binary name Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
Madhu Rajanna	2b1355061e	rename rbd.go to driver.go to keep code struct similar with cephfs renamed rbd.go to driver.go Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
Daniel-Pivonka	d621a58207	prometheus liveness probe sidecar Signed-off-by: Daniel-Pivonka dpivonka@redhat.com	2019-08-13 17:51:41 +00:00
Madhu Rajanna	2ca575b99d	Wrap error if failed to fetch mon This will help user to check whats the actual error. if the config file is having issue or the clusterid is not valid. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-13 17:16:27 +00:00
ShyamsundarR	20d336fca3	Add support to use ceph manager rbd command to delete an image Image deletion takes time proportional to the size of the image. Hence, ceph manager is enhanced to support async deletion of an image, or rather passing the task of deleting an image to the ceph manager. This commit leverages the ceph manager enhancement in the CSI code. NOTE: This is tested against a ceph cluster that is running Ceph master version of the code. Once other releases catch up in terms of the feature, the optimization would be available to the CSI driver as well. Fixes: #523 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 16:08:22 +00:00
Niels de Vos	31648c8feb	provisioners: add reconfiguring of PID limit The container runtime CRI-O limits the number of PIDs to 1024 by default. When many PVCs are requested at the same time, it is possible for the provisioner to start too many threads (or go routines) and executing 'rbd' commands can start to fail. In case a go routine can not get started, the process panics. The PID limit can be changed by passing an argument to kubelet, but this will affect all pids running on a host. Changing the parameters to kubelet is also not a very elegant solution. Instead, the provisioner pod can change the configuration itself. The pod is running in privileged mode and can write to /sys/fs/cgroup where the limit is configured. With this change, the limit is configured to 'max', just as if there is no limit at all. The logs of the csi-rbdplugin in the provisioner pod will reflect the change it makes when starting the service: $ oc -n rook-ceph logs -c csi-rbdplugin csi-rbdplugin-provisioner-0 .. I0726 13:59:19.737678 1 cephcsi.go:127] Initial PID limit is set to 1024 I0726 13:59:19.737746 1 cephcsi.go:136] Reconfigured PID limit to -1 (max) .. It is possible to pass a different limit on the commandline of the cephcsi executable. The following flag has been added: --pidlimit=<int> the PID limit to configure through cgroups This accepts special values -1 (max) and 0 (default, do not reconfigure). Other integers will be the limit that gets configured in cgroups. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2019-08-13 14:43:29 +00:00
ShyamsundarR	885ec7049d	Update Unstage transaction to undo steps done in Stage In unstage we now adhere to the transaction (or order of steps) done in Stage. To enable this we stash the image meta data into a local file on the staging path for use with unstage request. This helps in unmapping a stale map, in case the mount or other steps in the transaction are complete. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 14:07:52 +00:00
ShyamsundarR	44f7b1fe4b	Use "rbd device list" to list and find rbd images and their device paths This change also starts mapping nbd based access using ther rbd CLI as, it is a prerequisite to get device listing for nbd as well. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 14:07:52 +00:00
ShyamsundarR	925bda2881	Move mounting staging instance to a sub-path within staging path This commit moves the mounting of a block volumes and filesystems to a sub-file (already the case) or a sub-dir within the staging path. This enables using the staging path to store any additional data regarding the mount. For example, this will be extended in the future to store the fsid of the cluster, and maybe the pool name to map unmap requests to the right image. Also, this fixes the noted hack in the code, to determine in a common manner if there is a mount on the passed in staging path. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 14:07:52 +00:00
Madhu Rajanna	7c2fb6187a	remove post validation of rbd device once we map the rbd image on a node we will get the device name its mapped in the map output itself,no need to check the devicepath post rbd mapping Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-07 14:52:13 +00:00
Daniel-Pivonka	0063727199	Make parameter `pool` optional in CephFS storageclass Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-08-07 13:30:38 +00:00
Humble Chirammal	0786225937	Implement metrics for RBD plugin Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-08-01 11:58:54 +00:00
Madhu Rajanna	8a7022cc50	Add recover middleware for grpc server Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-01 11:02:27 +00:00
Madhu Rajanna	dfbdec4b6a	add validation to check if stagingPath exists It's CO responsibility to create the stagingPath as per the CSI spec. The CO SHALL ensure // that the path is directory and that the process serving the // request has `read` and `write` permission to that directory. The // CO SHALL be responsible for creating the directory if it does not // exist. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-29 12:52:10 +00:00
Ramana Raja	5af29662b2	cephfs: set the mode of the FS subvolumes ... and not that of the FS subvolume group `csi`. There is no reason for setting the mode of FS subvolume group `csi` (a CephFS subdirectory) as 777. It's default mode is 755. It's sufficient to set the mode of FS subvolumes within the subvolume group to `777`. Signed-off-by: Ramana Raja <rraja@redhat.com>	2019-07-29 10:11:48 +00:00
Ramana Raja	5932fff93e	cephfs: set pool layout of the FS subvolumes ... instead of that of the `csi` subvolume group. The pool layout specified via storage class's `pool` setting is a subvolume property and not a subvolume group property. The `csi` subvolume group may have subvolumes of different storage classes with different pool layouts. Fixes: #499 Signed-off-by: Ramana Raja <rraja@redhat.com>	2019-07-29 10:11:48 +00:00
Humble Devassy Chirammal	c7d990a96b	Merge pull request #460 from Madhu-1/fix-pluginapath Fix pluginpath for cephfs	2019-07-29 14:02:18 +05:30
Madhu Rajanna	d350b13d02	unmap rbd volume if we fail to get devicepath Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-26 16:23:39 +05:30
Madhu Rajanna	5d85f965df	unmap rbd volume if nodestage fails if mapping of rbd device is passed and mounting device to stagingpath fails or if chmod on targetpath fails ,which may leave up stale mapping if unstage is called this will be fixed by unmapping if somthing fails Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-25 16:27:30 +00:00
ShyamsundarR	bd204d7d45	Use --keyfile option to pass keys to all Ceph CLIs Every Ceph CLI that is invoked at present passes the key via the --key option, and hence is exposed to key being displayed on the host using a ps command or such means. This commit addresses this issue by stashing the key in a tmp file, which is again created on a tmpfs (or empty dir backed by memory). Further using such tmp files as arguments to the --keyfile option for every CLI that is invoked. This prevents the key from being visible as part of the argument list of the invoked program on the system. Fixes: #318 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-25 12:46:15 +00:00
Poornima G	c2835183e5	Remove user creation for every volume Currently, provisioner creates user for every volume and nodeplugin uses this user to mount that volume. But nodeplugin and provisioner already have admin credentials, hence using the admin credentials to mount the volume and getting rid of user creation for each volume. Signed-off-by: Poornima G <pgurusid@redhat.com>	2019-07-25 10:59:42 +00:00
Madhu Rajanna	a5164cfa41	Avoid keyring message while logging Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-25 09:48:09 +00:00
Madhu Rajanna	778cfb3090	provide option to set pluginpath for cephfs Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-25 14:47:42 +05:30
Humble Chirammal	561cc26e4c	Implement metrics for CephFS CSI driver Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-07-25 06:03:54 +00:00
Madhu Rajanna	f4c80dec9a	Implement NodeStage and NodeUnstage for rbd in NodeStage RPC call we have to map the device to the node plugin and make sure the the device will be mounted to the global path in nodeUnstage request unmount the device from global path and unmap the device if the volume mode is block we will be creating a file inside a stageTargetPath and it will be considered as the global path Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-24 12:49:21 +00:00
Humble Devassy Chirammal	5d5a6c4d91	Merge pull request #469 from Madhu-1/driver-version Update driver version during build time	2019-07-24 14:41:45 +05:30
ShyamsundarR	e5e332eded	Use correct file descriptor to parse errors File descriptors in use to parse errors from a few command invocations were incorrect. This led to inability to detect certain errors cases and act accordingly. One of the easiest noticeable issues was when an image is deleted but its RADOS keys and maps are still intact. In such cases the DeleteVolume call always errored out unable to find the image rather than, proceed with cleaning up the RADOS objects and returning a success. The original method of using stdout was incorrect, as the command was tested from within a shell script and the scripts STDIN/OUT/ERR was redirected to understand behavior. This is now tested using just the CLI in question, and also examining Ceph code, and further testing a couple of edge conditions by deleting backing images for PVs Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-16 07:51:10 +00:00
Madhu Rajanna	3f8bd3b2a6	Update driver version during build time update driver version and add git commit to the image. This will help us to identify what latest git commit image contains. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-12 15:54:52 +05:30
Poornima G	0d566ee30c	Backward compatibility for deleting and mounting old volumes Signed-off-by: Poornima G <pgurusid@redhat.com>	2019-07-12 05:42:41 +00:00
Poornima G	32ea550e3a	Modify CephFs provisioner to use the ceph mgr commands Currently CephFs provisioner mounts the ceph filesystem and creates a subdirectory as a part of provisioning the volume. Ceph now supports commands to provision fs subvolumes, hance modify the provisioner to use ceph mgr commands to (de)provision fs subvolumes. Signed-off-by: Poornima G <pgurusid@redhat.com>	2019-07-12 05:42:41 +00:00
ShyamsundarR	fa68c35f3b	Support mounting and deleting version 1.0.0 RBD volumes This commit adds support to mount and delete volumes provisioned by older plugin versions (1.0.0) in order to support backward compatibility to 1.0.0 created volumes. It adds back the ability to specify where older meta data was specified, using the metadatastorage option to the plugin. Further, using the provided meta data to mount and delete the older volumes. It also supports a variety of ways in which monitor information may have been specified (in the storage class, or in the secret), to keep the monitor information current. Testing done: - Mount/Delete 1.0.0 plugin created volume with monitors in the StorageClass - Mount/Delete 1.0.0 plugin created volume with monitors in the secret with a key "monitors" - Mount/Delete 1.0.0 plugin created volume with monitors in the secret with a user specified key - PVC creation and deletion with the current version (to ensure at the minimum no broken functionality) - Tested some negative cases, where monitor information is missing in secrets or present with a different key name, to understand if failure scenarios work as expected Updates #378 Follow-up work: - Documentation on how to upgrade to 1.1 plugin and retain above functionality for older volumes Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-08 15:40:17 +00:00
Madhu Rajanna	09f126691c	Add nil check for process Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-08 12:11:53 +00:00
Madhu Rajanna	df2680c50b	support mountflags during nodepublish in rbd Fixes: #292 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-03 21:46:36 +05:30
ShyamsundarR	c4a3675cec	Move locks to more granular locking than CPU count based As detailed in issue #279, current lock scheme has hash buckets that are count of CPUs. This causes a lot of contention when parallel requests are made to the CSI plugin. To reduce lock contention, this commit introduces granular locks per identifier. The commit also changes the timeout for gRPC requests to Create and Delete volumes, as the current timeout is 10s (kubernetes documentation says 15s but code defaults are 10s). A virtual setup takes about 12-15s to complete a request at times, that leads to unwanted retries of the same request, hence the increased timeout to enable operation completion with minimal retries. Tests to create PVCs before and after these changes look like so, Before: Default master code + sidecar provisioner --timeout option set to 30 seconds 20 PVCs Creation: 3 runs, 396/391/400 seconds Deletion: 3 runs, 218/271/118 seconds - Once was stalled for more than 8 minutes and cancelled the run After: Current commit + sidecar provisioner --timeout option set to 30 sec 20 PVCs Creation: 3 runs, 42/59/65 seconds Deletion: 3 runs, 32/32/31 seconds Fixes: #279 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-01 14:10:14 +00:00
ShyamsundarR	bc39c523b7	Fix returning success from DeleteSnapshot for stale requests Also reduced code duplication in fetching pool list from Ceph. DeleteSnapshot like DeleteVolume, should return a success when it detects that the snapshot keys are missing from the RADOS OMaps that store the snapshot UUID to request name mapping. This was missing in the code, and is now added. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-01 10:54:53 +00:00

1 2 3 4 5 ...

281 Commits