ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-05-27 17:26:42 +00:00

Author	SHA1	Message	Date
Niels de Vos	31648c8feb	provisioners: add reconfiguring of PID limit The container runtime CRI-O limits the number of PIDs to 1024 by default. When many PVCs are requested at the same time, it is possible for the provisioner to start too many threads (or go routines) and executing 'rbd' commands can start to fail. In case a go routine can not get started, the process panics. The PID limit can be changed by passing an argument to kubelet, but this will affect all pids running on a host. Changing the parameters to kubelet is also not a very elegant solution. Instead, the provisioner pod can change the configuration itself. The pod is running in privileged mode and can write to /sys/fs/cgroup where the limit is configured. With this change, the limit is configured to 'max', just as if there is no limit at all. The logs of the csi-rbdplugin in the provisioner pod will reflect the change it makes when starting the service: $ oc -n rook-ceph logs -c csi-rbdplugin csi-rbdplugin-provisioner-0 .. I0726 13:59:19.737678 1 cephcsi.go:127] Initial PID limit is set to 1024 I0726 13:59:19.737746 1 cephcsi.go:136] Reconfigured PID limit to -1 (max) .. It is possible to pass a different limit on the commandline of the cephcsi executable. The following flag has been added: --pidlimit=<int> the PID limit to configure through cgroups This accepts special values -1 (max) and 0 (default, do not reconfigure). Other integers will be the limit that gets configured in cgroups. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2019-08-13 14:43:29 +00:00
ShyamsundarR	44f7b1fe4b	Use "rbd device list" to list and find rbd images and their device paths This change also starts mapping nbd based access using ther rbd CLI as, it is a prerequisite to get device listing for nbd as well. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 14:07:52 +00:00
Madhu Rajanna	02bcb5f16a	Enable leader election in v1.14+ Use Deployment with leader election instead of StatefulSet Deployment behaves better when a node gets disconnected from the rest of the cluster - new provisioner leader is elected in ~15 seconds, while it may take up to 5 minutes for StatefulSet to start a new replica. Refer: kubernetes-csi/external-provisioner@52d1fbc Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-05 07:11:44 +00:00
ShyamsundarR	bd204d7d45	Use --keyfile option to pass keys to all Ceph CLIs Every Ceph CLI that is invoked at present passes the key via the --key option, and hence is exposed to key being displayed on the host using a ps command or such means. This commit addresses this issue by stashing the key in a tmp file, which is again created on a tmpfs (or empty dir backed by memory). Further using such tmp files as arguments to the --keyfile option for every CLI that is invoked. This prevents the key from being visible as part of the argument list of the invoked program on the system. Fixes: #318 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-25 12:46:15 +00:00
Madhu Rajanna	f4c80dec9a	Implement NodeStage and NodeUnstage for rbd in NodeStage RPC call we have to map the device to the node plugin and make sure the the device will be mounted to the global path in nodeUnstage request unmount the device from global path and unmap the device if the volume mode is block we will be creating a file inside a stageTargetPath and it will be considered as the global path Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-24 12:49:21 +00:00
ShyamsundarR	c4a3675cec	Move locks to more granular locking than CPU count based As detailed in issue #279, current lock scheme has hash buckets that are count of CPUs. This causes a lot of contention when parallel requests are made to the CSI plugin. To reduce lock contention, this commit introduces granular locks per identifier. The commit also changes the timeout for gRPC requests to Create and Delete volumes, as the current timeout is 10s (kubernetes documentation says 15s but code defaults are 10s). A virtual setup takes about 12-15s to complete a request at times, that leads to unwanted retries of the same request, hence the increased timeout to enable operation completion with minimal retries. Tests to create PVCs before and after these changes look like so, Before: Default master code + sidecar provisioner --timeout option set to 30 seconds 20 PVCs Creation: 3 runs, 396/391/400 seconds Deletion: 3 runs, 218/271/118 seconds - Once was stalled for more than 8 minutes and cancelled the run After: Current commit + sidecar provisioner --timeout option set to 30 sec 20 PVCs Creation: 3 runs, 42/59/65 seconds Deletion: 3 runs, 32/32/31 seconds Fixes: #279 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-01 14:10:14 +00:00
Humble Chirammal	027331c186	Use sidecar which support cloning Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-06-28 01:11:06 +00:00
Madhu Rajanna	59d3365d3b	update statefulset and daemonset api-version Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-25 14:00:46 +00:00
Madhu Rajanna	983f28ad2f	Revert "Use Deployment with leader election instead of StatefulSet" This reverts commit a151bec94b93f0f8ddbe50b572bcbe94e4c344ae. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-14 13:39:03 +00:00
Madhu Rajanna	a151bec94b	Use Deployment with leader election instead of StatefulSet Deployment behaves better when a node gets disconnected from the rest of the cluster - new provisioner leader is elected in ~15 seconds, while it may take up to 5 minutes for StatefulSet to start a new replica. Refer: `52d1fbcf9d` Fixes: #335 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-10 09:51:22 +05:30
Humble Chirammal	45ae1c56e4	Promote sidecars to latest available version tags. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-06-02 15:01:34 +05:30
Madhu Rajanna	2d560ba087	update ceph-csi to build and use a single docker image currently, we have 3 docker files(cephcsi,rbd,cephfs) in the ceph-csi repo. [commit ](`85e121ebfe`) added by John to build a single image which can act as rbd or cephfs based on the input configuration. This PR updates the makefile and kubernetes templates to use the unified image and also its deletes the other two dockerfiles. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-05-28 18:10:22 +00:00
ShyamsundarR	d02e50aa9b	Removed config maps and replaced with rados omaps Existing config maps are now replaced with rados omaps that help store information regarding the requested volume names and the rbd image names backing the same. Further to detect cluster, pool and which image a volume ID refers to, changes to volume ID encoding has been done as per provided design specification in the stateless ceph-csi proposal. Additional changes and updates, - Updated documentation - Updated manifests - Updated Helm chart - Addressed a few csi-test failures Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-05-19 12:29:33 +00:00
Kaushal M	63d00afb28	deploy: Use aggregated ClusterRoles The kubernetes manifests and Helm templates have been updated to use aggregated ClusterRoles. The same change has been done in Rook as well. Refer rook/rook#2634 and rook/rook#2975 Signed-off-by: Kaushal M <kshlmster@gmail.com>	2019-04-17 11:15:08 +05:30
Yuxiang Zhu	35c55aeb68	add missing PV update permission for rbd attacher PR #290 missed the update permission to persistentvolumes. Without that permission, you will get the following error when attaching a RBD volume to a pod: ``` Warning FailedAttachVolume 100s (x11 over 7m52s) attachdetach-controller AttachVolume.Attach failed for volume "pvc-d23f8745-60bb-11e9-bd35-5254001c78d6" : could not add PersistentVolume finalizer: persistentvolumes "pvc-d23f8745-60bb-11e9-bd35-5254001c78d6" is forbidden: User "system:serviceaccount:kube-system:rbd-csi-provisioner" cannot update resource "persistentvolumes" in API group "" at the cluster scope ```	2019-04-17 11:16:43 +08:00
Madhu Rajanna	e4d830a2c2	remove extra node rules in provisioner Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30
Madhu Rajanna	54d52bb411	update attacher endpoint Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30
Madhu Rajanna	5c600a1bc5	update rbd helm chats to deploy attacher as sidecar container in provisioner pod Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30
Madhu Rajanna	3ef11e06c3	deploy attacher sidecar in rbd provisioner sts currently we are deploying external-attacher as a seperate statefulset, which leads to attacher communicating with the node provisoner daemonset, This PR deploys external-attacher as a sidecar container inside provisioner statefulset, so that external-provisioner always communicates with the plugin responsible for the provision controller capcabilities. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30
王怀宗	4228ceb51e	rbd deploy csi-provisioner rbac add resources `nodes` get, list, watch #293	2019-04-01 16:48:30 +00:00
ShyamsundarR	2064e674a4	Addressed using k8s client APIs to fetch secrets Based on the review comments addressed the following, - Moved away from having to update the pod with volumes when a new Ceph cluster is added for provisioning via the CSI driver - The above now used k8s APIs to fetch secrets - TBD: Need to add a watch mechanisim such that these secrets can be cached and updated when changed - Folded the Cephc configuration and ID/key config map and secrets into a single secret - Provided the ability to read the same config via mapped or created files within the pod Tests: - Ran PV creation/deletion/attach/use using new scheme StorageClass - Ran PV creation/deletion/attach/use using older scheme to ensure nothing is broken - Did not execute snapshot related tests Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-03-26 16:19:24 +00:00
Madhu Rajanna	52397b4dc4	rename socket directory to a common name as the socket directory will be created inside the container no need to follow the plugin name in for the directory creation, this will also reduce the code changes if we want to change driver name. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-03-22 09:58:21 +05:30
Madhu Rajanna	d61a87b42e	Fix driver name as per CSI spec Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-03-13 12:04:30 +05:30
Madhu Rajanna	c0745486a7	add event rules for provisioner Fixes: #https://github.com/ceph/ceph-csi/pull/234#issuecomment-468967752 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-03-04 14:34:14 +00:00
Madhu Rajanna	2ab1f3e82d	add csinodeinfos rules Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-02-27 19:32:07 +05:30
Madhu Rajanna	f4a0726226	Fix rbac issue in rbd plugin remove unwanted rules and update rbac to have permission to modify endpoints and configmaps in the current namespace. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-02-27 16:45:34 +05:30
Madhu Rajanna	cee9c4f8b2	Fix yamllint issues Signed-off-by: Madhu Rajanna <mrajanna@redhat.com>	2019-02-07 12:19:14 +00:00
Madhu Rajanna	9ba501617d	update sidecar images to stable version (v1.0.1) Fixes: #157 Signed-off-by: Madhu Rajanna <mrajanna@redhat.com>	2019-02-04 15:37:28 +05:30
Madhu Rajanna	5b512cd48c	Add snapshot yaml files and volume clone capabilities to provisioner. Signed-off-by: Madhu Rajanna <mrajanna@redhat.com>	2019-01-28 10:26:57 +05:30
Huamin Chen	e0e764b3a1	review feedback: tune rbd provisioner rbac Signed-off-by: Huamin Chen <hchen@redhat.com>	2019-01-23 10:05:15 -05:00
Huamin Chen	c6c496ff59	switch to node registrar	2019-01-22 14:46:41 -05:00
Huamin Chen	48407e2484	add csi volume device mount path to csi plugin Signed-off-by: Huamin Chen <hchen@redhat.com>	2019-01-17 08:57:18 -05:00
Huamin Chen	263c45bb45	enable csi block; use canary external-provisioner image to pick up block volume provisioning Signed-off-by: Huamin Chen <hchen@redhat.com>	2019-01-16 13:52:45 -05:00
Masaki Kimura	165b82a44c	Add block supports to rbd driver	2019-01-16 12:49:02 -05:00
Huamin Chen	aed7506d88	fix merge leftovers; use canary driver-registrar image, as v1.0.0 is not hosted in quay.io Signed-off-by: Huamin Chen <hchen@redhat.com>	2019-01-15 13:31:06 -05:00
Huamin Chen	85b8415024	Merge branch 'master' into master-to-1.0	2019-01-15 16:15:30 +00:00
mickymiek	b23ee70d7f	fix rbac rules for configmaps	2019-01-14 20:15:09 +00:00
mickymiek	7d47bb0698	make k8s_configmap default metadatastorage for k8s deployments	2019-01-14 20:15:09 +00:00
mickymiek	62d65ad0cb	cm metadata persist for rbd and cephfs	2019-01-14 20:15:09 +00:00
Mike Cronce	d5c6f889c5	deploy/rbd/kubernetes: Use CSI 1.x plugin directory	2018-12-04 15:38:16 -05:00
Mike Cronce	c552b24c49	deploy/rbd: Updated all image tags from v0.3.0 to v1.0.0	2018-11-29 13:16:25 -05:00
Huamin Chen	3436a094f7	support nsmounter when running in containerized mode Signed-off-by: Huamin Chen <hchen@redhat.com>	2018-10-15 14:59:41 +00:00
Huamin Chen	4453cfce5b	set dns policy in csi plugin so storage class can use mons' FQDN Signed-off-by: Huamin Chen <hchen@redhat.com>	2018-09-19 14:39:43 +00:00
Huamin Chen	8955eb03bc	support rbd-nbd Signed-off-by: Huamin Chen <hchen@redhat.com>	2018-09-17 18:12:22 +00:00
Masaki Kimura	02fdf238b0	Add configurations to handle kubelet-plugin-watcher to sample yaml files Fixes: #73	2018-09-10 19:16:17 +00:00
gman	e2910f1c18	deployment update for 0.3.0	2018-08-07 15:11:22 +02:00
Seungcheol Ko	bc34bd389e	support image features for csi-rbdplugin	2018-07-21 00:59:54 +09:00
chun wang	c0847ce868	fix CSI plugin pvc.yaml file storageClassName Error Signed-off-by: chunwang Lin <q60563@gmail.com>	2018-04-26 13:32:24 +08:00
gman	99bdbf2182	Merge branch 'master' into wip-cephfs	2018-03-13 11:21:34 +01:00
gman	1c1b0eab1e	WIP cephfs CSI plugin	2018-03-05 13:21:30 +01:00

1 2

100 Commits