ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-05-24 00:06:41 +00:00

Author	SHA1	Message	Date
Humble Chirammal	65982a0489	deploy: add `--retry-interval-start` arg for attacher & resizer Considering this parameter is available for other sidecars we should have a parity between the sidecars. Adding it for the same reason Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-05-27 15:52:08 +00:00
ShyamsundarR	5c4abf8347	Add topology support to ceph-csi Signed-off-by: ShyamsundarR <srangana@redhat.com>	2020-04-14 14:14:29 +00:00
Madhu Rajanna	58765e27a0	Resizer: Update resizer image version Recently resizer 0.5.0 has been released. This PR updated the resizer container from v0.4.0 to v0.5.0 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-06 12:06:54 +00:00
Madhu Rajanna	bcd646ee55	Deprecate grpc metrics in ceph-csi As kubernetes CSI sidecar is exposing the GRPC mertics we can make use of the same in ceph-csi we dont need to expose our own. update: #881 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-01 11:59:37 +00:00
Wong Hoi Sing Edison	ebe5aa00cf	Upgrade: csi-node-driver-registrar from v1.2.0 to v1.3.0 See https://github.com/kubernetes-csi/node-driver-registrar/releases/tag/v1.3.0 See https://github.com/kubernetes-csi/node-driver-registrar/blob/v1.3.0/CHANGELOG-1.3.md	2020-04-01 08:39:37 +00:00
Humble Chirammal	8265c431a7	Bring attacher controllers to latest version Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-03-20 11:09:05 +00:00
Madhu Rajanna	d02dfe2dfe	Remove unwanted RBAC rules from ceph-csi There are currently unwanted RBAC permission is given for ceph-csi, This PR reduces removes such unwanted RBAC resources. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-02-13 21:36:27 +00:00
Madhu Rajanna	034b123478	Remove mount cache for cephfs PR #282 introduces the mount cache to solve cephfs fuse mount issue when cephfs plugin pod restarts .This is not working as intended. This PR removes the code for maintainability. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-02-11 15:11:21 +00:00
Madhu Rajanna	eb2fb9233b	Add run hostpath to daemonset pods `/run/mount` need to be share between host and csi-plugin containers for `/run/mount/utab` this is required to ensures that the network is not stopped prior to unmounting the network devices. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-01-28 16:50:18 +00:00
Oguz Kilcan	aadce54b2f	Added PodSecurityPolicy support	2020-01-22 08:19:42 +00:00
wilmardo	f04af5742d	refact: Remove Kubernetes 1.13.x support Signed-off-by: wilmardo <info@wilmardenouden.nl>	2020-01-20 10:32:30 +00:00
Madhu Rajanna	e0cc7740f6	CSI: run all containers as privileged in daemonset pods On systems with SELinux enabled, non-privileged containers can't access data of privileged containers. Since the socket is exposed by privileged containers, all sidecars must be privileged too. This is needed only for containers running in daemonset as we are using bidirectional mounts in daemonset Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-01-13 13:21:29 +00:00
Madhu Rajanna	fbda8cc4ca	Use EmptyDir to store provisioner socket currently, we are making use of host path directory to store the provisioner socket, as this the socket is not needed by anyone else other than containers inside the provisioner pod using the empty directory to store this socket is the best option. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-01-13 13:21:29 +00:00
Wong Hoi Sing Edison	543360ee00	Upgrade: csi-attacher from v1.2.0 to v2.1.0 See https://github.com/kubernetes-csi/external-attacher/releases/tag/v2.1.0 See https://github.com/kubernetes-csi/external-attacher/blob/v2.1.0/CHANGELOG-2.1.md	2020-01-07 14:27:29 +00:00
Wong Hoi Sing Edison	f37bdfdd44	Upgrade: csi-node-driver-registrar from v1.1.0 to v1.2.0 See https://github.com/kubernetes-csi/node-driver-registrar/releases/tag/v1.2.0 See https://github.com/kubernetes-csi/node-driver-registrar/blob/v1.2.0/CHANGELOG-1.2.md	2020-01-06 07:48:41 +00:00
Wong Hoi Sing Edison	74cb18bd28	Upgrade: csi-resizer from v0.3.0 to v0.4.0 See https://github.com/kubernetes-csi/external-resizer/releases/tag/v0.4.0 See https://github.com/kubernetes-csi/external-resizer/blob/v0.4.0/CHANGELOG-0.4.md	2020-01-05 07:21:12 +00:00
Wong Hoi Sing Edison	3e656769b7	Update csi-provisioner from v1.3.0 to v1.4.0 See https://github.com/kubernetes-csi/external-provisioner/releases/tag/v1.4.0 See https://github.com/kubernetes-csi/external-provisioner/blob/v1.4.0/CHANGELOG-1.4.md	2020-01-02 15:53:07 +00:00
Madhu Rajanna	b849b7daaa	Fix leader election flag in deployment files Fixes: https://github.com/ceph/ceph-csi/issues/748 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-12-17 12:19:01 +00:00
Humble Chirammal	671e2d814a	Add volumesize roundoff for expandrequest Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-11-27 14:00:47 +00:00
Humble Chirammal	ac09c5553c	Add E2E for cephfs resize functionality Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-11-27 14:00:47 +00:00
Madhu Rajanna	9287948991	update registration directory name updated cephfs registration directory name to match with rbd implementaion Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-10-10 07:16:09 +00:00
Humble Chirammal	1efdf14ac5	At present, the request timeout of sidecars are at the 60s and this is a request to increase this time out value to 150s or higher. The higher timeout value can help to reduce the load of our backend ceph cluster and also can avoid throttling issues at sidecars to an extent. Fix# #602 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-10-09 05:28:40 +00:00
Daniel-Pivonka	cd52798a51	Change default csi liveness ports to ones less common Signed-off-by: Daniel-Pivonka <dpivonka@redhat.com>	2019-10-01 15:08:58 +00:00
wilmardo	6ee381db3a	refactor: Merge 1.13 and 1.14 Helm charts and improve charts Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-09-27 05:49:18 +00:00
Madhu Rajanna	e2890a27ff	connect to provisioner socket Fixes: #619 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-09-20 08:13:19 +00:00
Madhu Rajanna	a81a3bf96b	implement grpc metrics for ceph-csi Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-30 06:50:32 +00:00
wilmardo	3111e7712a	feat: Adds Ceph logo as icon for Helm charts Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-08-20 05:34:28 +00:00
Madhu Rajanna	0da4bd5151	start controller or node server based on config if both controller and nodeserver flags are set/unset cephcsi will start both server, if only one flag is set, it will start relavent service. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-19 06:11:43 +00:00
wilmardo	0a90762970	fix: Adds liveness sidecar to v1.14+ helm charts Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-08-16 08:38:49 +00:00
wilmardo	30fb7de118	feat: Implement helm lint Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-08-16 07:38:33 +00:00
Daniel-Pivonka	d621a58207	prometheus liveness probe sidecar Signed-off-by: Daniel-Pivonka dpivonka@redhat.com	2019-08-13 17:51:41 +00:00
wilmardo	cba6115e30	Fix 1.13 charts Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-08-13 16:42:15 +00:00
wilmardo	ca5fbc180c	Rework of helm charts Signed-off-by: wilmardo <info@wilmardenouden.nl>	2019-08-13 16:42:15 +00:00
Niels de Vos	31648c8feb	provisioners: add reconfiguring of PID limit The container runtime CRI-O limits the number of PIDs to 1024 by default. When many PVCs are requested at the same time, it is possible for the provisioner to start too many threads (or go routines) and executing 'rbd' commands can start to fail. In case a go routine can not get started, the process panics. The PID limit can be changed by passing an argument to kubelet, but this will affect all pids running on a host. Changing the parameters to kubelet is also not a very elegant solution. Instead, the provisioner pod can change the configuration itself. The pod is running in privileged mode and can write to /sys/fs/cgroup where the limit is configured. With this change, the limit is configured to 'max', just as if there is no limit at all. The logs of the csi-rbdplugin in the provisioner pod will reflect the change it makes when starting the service: $ oc -n rook-ceph logs -c csi-rbdplugin csi-rbdplugin-provisioner-0 .. I0726 13:59:19.737678 1 cephcsi.go:127] Initial PID limit is set to 1024 I0726 13:59:19.737746 1 cephcsi.go:136] Reconfigured PID limit to -1 (max) .. It is possible to pass a different limit on the commandline of the cephcsi executable. The following flag has been added: --pidlimit=<int> the PID limit to configure through cgroups This accepts special values -1 (max) and 0 (default, do not reconfigure). Other integers will be the limit that gets configured in cgroups. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2019-08-13 14:43:29 +00:00
ShyamsundarR	44f7b1fe4b	Use "rbd device list" to list and find rbd images and their device paths This change also starts mapping nbd based access using ther rbd CLI as, it is a prerequisite to get device listing for nbd as well. Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-08-13 14:07:52 +00:00
Madhu Rajanna	02bcb5f16a	Enable leader election in v1.14+ Use Deployment with leader election instead of StatefulSet Deployment behaves better when a node gets disconnected from the rest of the cluster - new provisioner leader is elected in ~15 seconds, while it may take up to 5 minutes for StatefulSet to start a new replica. Refer: kubernetes-csi/external-provisioner@52d1fbc Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-08-05 07:11:44 +00:00
ShyamsundarR	bd204d7d45	Use --keyfile option to pass keys to all Ceph CLIs Every Ceph CLI that is invoked at present passes the key via the --key option, and hence is exposed to key being displayed on the host using a ps command or such means. This commit addresses this issue by stashing the key in a tmp file, which is again created on a tmpfs (or empty dir backed by memory). Further using such tmp files as arguments to the --keyfile option for every CLI that is invoked. This prevents the key from being visible as part of the argument list of the invoked program on the system. Fixes: #318 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-25 12:46:15 +00:00
Madhu Rajanna	f4c80dec9a	Implement NodeStage and NodeUnstage for rbd in NodeStage RPC call we have to map the device to the node plugin and make sure the the device will be mounted to the global path in nodeUnstage request unmount the device from global path and unmap the device if the volume mode is block we will be creating a file inside a stageTargetPath and it will be considered as the global path Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-07-24 12:49:21 +00:00
ShyamsundarR	c4a3675cec	Move locks to more granular locking than CPU count based As detailed in issue #279, current lock scheme has hash buckets that are count of CPUs. This causes a lot of contention when parallel requests are made to the CSI plugin. To reduce lock contention, this commit introduces granular locks per identifier. The commit also changes the timeout for gRPC requests to Create and Delete volumes, as the current timeout is 10s (kubernetes documentation says 15s but code defaults are 10s). A virtual setup takes about 12-15s to complete a request at times, that leads to unwanted retries of the same request, hence the increased timeout to enable operation completion with minimal retries. Tests to create PVCs before and after these changes look like so, Before: Default master code + sidecar provisioner --timeout option set to 30 seconds 20 PVCs Creation: 3 runs, 396/391/400 seconds Deletion: 3 runs, 218/271/118 seconds - Once was stalled for more than 8 minutes and cancelled the run After: Current commit + sidecar provisioner --timeout option set to 30 sec 20 PVCs Creation: 3 runs, 42/59/65 seconds Deletion: 3 runs, 32/32/31 seconds Fixes: #279 Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-07-01 14:10:14 +00:00
Humble Chirammal	027331c186	Use sidecar which support cloning Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-06-28 01:11:06 +00:00
Madhu Rajanna	59d3365d3b	update statefulset and daemonset api-version Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-25 14:00:46 +00:00
Madhu Rajanna	983f28ad2f	Revert "Use Deployment with leader election instead of StatefulSet" This reverts commit a151bec94b93f0f8ddbe50b572bcbe94e4c344ae. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-14 13:39:03 +00:00
Madhu Rajanna	a151bec94b	Use Deployment with leader election instead of StatefulSet Deployment behaves better when a node gets disconnected from the rest of the cluster - new provisioner leader is elected in ~15 seconds, while it may take up to 5 minutes for StatefulSet to start a new replica. Refer: `52d1fbcf9d` Fixes: #335 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-06-10 09:51:22 +05:30
Humble Devassy Chirammal	95252dd9f6	Merge pull request #390 from ShyamsundarR/stateless-cephfs Make CephFS plugin stateless reusing RADOS based journal scheme	2019-06-07 10:44:18 +05:30
Humble Chirammal	45ae1c56e4	Promote sidecars to latest available version tags. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2019-06-02 15:01:34 +05:30
ShyamsundarR	b9cd0e18ad	Make CephFS plugin stateless reusing RADOS based journal scheme This is a part of the stateless set of commits for CephCSI. This commit removes the dependency on config maps to store cephFS provisioned volumes, and instead relies on RADOS based objects and keys, and required CSI VolumeID encoding to detect the provisioned volumes. Changes: - Provide backward compatibility to provisioned volumes by older plugin versions (1.0.0 or older) - Remove Create/Delete support for statically provisioned volumes (fixes #382) - Added namespace support to RADOS OMaps and used the same to store RADOS CSI objects and keys in the CephFS metadata pool - Added support to mention fsname for CephFS provisioning (fixes #359) - Changed field name in CSI Identifier to 'location', to denote a pool or fscid - Updated mounter cache to use new scheme - Required Helm manifests are updated - Required documentation and other manifests are updated - Made driver option 'metadatastorage' as optional, as fresh installs do not need to specify the same Testing done: - Create/Mount/Delete PVC - Create/Delete 5 PVCs - Mount version 1.0.0 PVC - Delete version 1.0.0 PV - Mount Statically defined PV/PVC/Pod - Mount Statically defined version 1.0.0 PV/PVC/Pod - Delete Statically defined version 1.0.0 PV/PVC/Pod - Node restart when mounted to test mountcache - Use InstanceID other than 'default' - RBD basic round of tests, as namespace is added to OMaps - csitest against ceph-fs plugin - NOTE: CephFS plugin still does not detect and address already created volumes but of a different size - Test not providing any value to the metadata storage parameter Signed-off-by: ShyamsundarR <srangana@redhat.com>	2019-05-30 06:20:35 -04:00
Madhu Rajanna	2d560ba087	update ceph-csi to build and use a single docker image currently, we have 3 docker files(cephcsi,rbd,cephfs) in the ceph-csi repo. [commit ](`85e121ebfe`) added by John to build a single image which can act as rbd or cephfs based on the input configuration. This PR updates the makefile and kubernetes templates to use the unified image and also its deletes the other two dockerfiles. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-05-28 18:10:22 +00:00
Kaushal M	63d00afb28	deploy: Use aggregated ClusterRoles The kubernetes manifests and Helm templates have been updated to use aggregated ClusterRoles. The same change has been done in Rook as well. Refer rook/rook#2634 and rook/rook#2975 Signed-off-by: Kaushal M <kshlmster@gmail.com>	2019-04-17 11:15:08 +05:30
Madhu Rajanna	e4d830a2c2	remove extra node rules in provisioner Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30
Madhu Rajanna	54d52bb411	update attacher endpoint Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2019-04-04 11:11:29 +05:30

1 2 3

141 Commits