ceph-csi

mirror of https://github.com/ceph/ceph-csi.git synced 2025-05-30 10:36:41 +00:00

Author	SHA1	Message	Date
Prasanna Kumar Kalever	b6a88dd728	rbd: add volume healer Problem: ------- For rbd nbd userspace mounter backends, after a restart of the nodeplugin all the mounts will start seeing IO errors. This is because, for rbd-nbd backends there will be a userspace mount daemon running per volume, post restart of the nodeplugin pod, there is no way to restore the daemons back to life. Solution: -------- The volume healer is a one-time activity that is triggered at the startup time of the rbd nodeplugin. It navigates through the list of volume attachments on the node and acts accordingly. For now, it is limited to nbd type storage only, but it is flexible and can be extended in the future for other backend types as needed. From a few feets above: This solves a severe problem for nbd backed csi volumes. The healer while going through the list of volume attachments on the node, if finds the volume is in attached state and is of type nbd, then it will attempt to fix the rbd-nbd volumes by sending a NodeStageVolume request with the required volume attributes like secrets, device name, image attributes, and etc.. which will finally help start the required rbd-nbd daemons in the nodeplugin csi-rbdplugin container. This will allow reattaching the backend images with the right nbd device, thus allowing the applications to perform IO without any interruptions even after a nodeplugin restart. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	10e4eee481	deploy: add few more cluster-roles for rbd nodeplugin Nodeplugin needs below cluster roles: persistentvolumes: get volumeattachments: list, get These additional permissions are needed by the volume healer. Volume healer aims at fixing the volume health issues at the very startup time of the nodeplugin. As part of its operations, volume healer has to run through the list of volume attachments and understand details about each persistentvolume. The later commits will use these additional cluster roles. Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Prasanna Kumar Kalever	874f6629fb	rbd: get default plugin path Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-07-16 16:30:58 +00:00
Niels de Vos	8662e01d2c	deploy: allow RBD components to get ServiceAccounts The provisioner and node-plugin have the capability to connect to Hashicorp Vault with a ServiceAccount from the Namespace where the PVC is created. This requires permissions to read the contents of the ServiceAccount from an other Namespace than where Ceph-CSI is deployed. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-07-13 17:16:35 +00:00
Yug	a852e66133	deploy: update node-driver-registrar update node-driver-registrar to the latest available release version. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-25 15:45:14 +00:00
Yug	b19717ef99	deploy: update csi-snapshotter sidecar update csi-snapshotter sidecar to the latest available release version. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-25 15:45:14 +00:00
Yug	77f2db8875	deploy: update csi-resizer sidecar update csi-resizer sidecar to the latest available release version. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-25 15:45:14 +00:00
Yug	fac20a9446	deploy: update csi-attacher sidecar update csi-attacher sidecar to the latest available release version. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-25 15:45:14 +00:00
Yug	cf63be41c8	deploy: update csi-provisioner sidecar update csi-provisioner sidecar to the latest available release version. Signed-off-by: Yug <yuggupta27@gmail.com>	2021-06-25 15:45:14 +00:00
Yati Padia	774e8e4042	util: enable golang profiling Add support for golang profiling. Standard tools like go tool pprof and curl work. example: $ go tool pprof http://localhost:8080/debug/pprof/profile $ go tool pprof http://localhost:8080/debug/pprof/heap $ curl http://localhost:8080/debug/pprof/heap?debug=1 https://golang.org/pkg/net/http/pprof/ contains more details about the pprof interface. Fixes: #1699 Signed-off-by: Yati Padia <ypadia@redhat.com>	2021-05-25 10:41:22 +00:00
Humble Chirammal	1b0ebc43d4	deploy: use serviceAccountName instead of serviceAccount in yamls serviceAccount is the depricated alias for serviceAccountName, so it is recommended/suggested to use serviceAccountName instead. For ex. reference: https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/ Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2021-04-06 09:00:35 +00:00
Madhu Rajanna	fba6a2d0c3	deploy: add csidriver object for cephfs and rbd csidriver object can be created on the kubernetes for below reason. If a CSI driver creates a CSIDriver object, Kubernetes users can easily discover the CSI Drivers installed on their cluster (simply by issuing kubectl get CSIDriver) Ref: https://kubernetes-csi.github.io/docs/csi-driver-object.html#what-is-the-csidriver-object attachRequired is always required to be set to true to avoid issue on RWO PVC. more details about it at https://github.com/rook/rook/pull/4332 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-03-31 13:41:35 +00:00
Prasanna Kumar Kalever	ea4489da68	deploy: bump the snapshotter sidecar image version to v4.0.0 use the latest version of csi-snapshotter sidecar image at the provisioner templates Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2021-03-15 15:11:01 +00:00
Niels de Vos	06d5d8f23a	build: libcephfs-devel is not needed go-ceph does not use CephFS development headers, so there is no need to install libcephfs-devel. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2021-03-10 09:26:00 +00:00
Madhu Rajanna	7835609b06	set priorityclass on provisioner pods set system-cluster-critical priorityclass on provisioner pods. the system-cluster-critical is having lowest priority compared to node-critical. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-02-19 13:29:09 +00:00
Madhu Rajanna	2190ca922e	set priorityclass on plugin pods set system-node-critical priority on the plugin pods, as its the highest priority and this need to be applied on plugin pods as its critical for storage in cluster. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-02-19 13:29:09 +00:00
Madhu Rajanna	e6098520d1	rbd: add configmap get clusterrole for provisioner as provisioner need to get the configmap from different namespace to check tenant configuration. added the clusterrole get access for the same. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2021-02-04 14:58:40 +00:00
Matt Brown	123a26abb3	deploy, helm: enable secret watch in rbac enables secret ''watch'' rbac permission for ceph-csi-rbd-provisioner role. Fixes 1841. Signed-off-by: Matt Brown <matthewbrown18@gmail.com>	2021-01-26 15:41:11 +00:00
Niels de Vos	dd29c6c06b	deploy: allow rbd nodeplugin to read ConfigMaps from Tenants Tenants can have their own ConfigMap that contains connection parameters to the Vault Service where the PV encyption keys are located. It is possible for a Tenant to use a different Vault Service than the one configured by the Storage Admin who deployed Ceph-CSI. For this, the node-plugin needs to be able to read the ConfigMap from the Tenants namespace. See-also: docs/design/proposals/encryption-with-vault-tokens.md Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-16 16:45:29 +00:00
Madhu Rajanna	b3fbcb9c95	rbd: read configuration from the configmap if the kms encryption configmap is not mounted as a volume to the CSI pods, add the code to read the configuration from the kubernetes. Later the code to fetch the configmap will be moved to the new sidecar which is will talk to respective CO to fetch the encryption configurations. The k8s configmap uses the standard vault spefic names to add the configurations. this will be converted back to the CSI configurations. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-16 15:05:05 +00:00
Niels de Vos	cb1899b8c0	deploy: allow rbd nodeplugin to read Secrets from Tenants In order to fetch the Kubernetes Secret with the Vault Token for a Tenant, the ClusterRole needs to allow reading Secrets from all Kubernetes Namespaces (each Tenant has their own Namespace). Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-14 14:45:09 +00:00
Niels de Vos	0e6443e4c1	deploy: add --extra-create-metadata arg to csi-provisioner sidecar This argument in csi-provisioner sidecar allows us to receive pv/pvc name/namespace metadata in the createVolume() request. For ex: csi.storage.k8s.io/pvc/name csi.storage.k8s.io/pvc/namespace csi.storage.k8s.io/pv/name This is a useful information which can be used depend on the use case we have at our driver. The features like vault token enablement for multi tenancy, RBD mirroring ..etc can consume this based on the need. Refer: #1305 Signed-off-by: Humble Chirammal <hchiramm@redhat.com> Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-12-09 13:58:48 +00:00
Madhu Rajanna	518ccf42b3	deploy: add option to set default-fstype in provisioner external-provisioner is exposing a new argument to set the default fstype while starting the provisioner sidecar, if the fstype is not specified in the storageclass the default fstype will be applied for the pvc created from the storageclass. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-08 16:09:25 +00:00
Madhu Rajanna	e3d1ba7703	deploy: set topology=false in provisioner sidecar with csi-provisioner v2.x the topology based provisioning will not have any backward compatibility with older version of kubernetes, if the nodes are not labeled with topology keys, the pvc creation is going to get fail with error `accessibility requirements: no available topology found`, disabling the topology based provisioning by default, if user want to use it he can always enable it. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-12-08 16:09:25 +00:00
Mudit Agarwal	a92d8d7f2e	deploy: update csi sidecar images This PR makes the changes in csi templates and upgrade documentation required for updating csi sidecar images. Signed-off-by: Mudit Agarwal <muagarwa@redhat.com>	2020-12-08 10:23:34 +00:00
Madhu Rajanna	39b1f2b4d3	cleanup: fix mispell words fixed mispell words in the repo. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-29 12:47:46 +05:30
Madhu Rajanna	fc9b2e5ac5	deploy: update deployment template for new controller updated deployment template for the new controller and also added `update` configmap RBAC for the controller as the controller uses the configmap for the leader election. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-11-28 18:50:00 +00:00
Niels de Vos	b26d33b7c1	build: install git as when building from Dockerfile When running a simple build with only the required arguments, the following warning are reported: $ buildah bud --build-arg=BASE_IMAGE=ceph/ceph:v15 --build-arg=GO_ARCH=amd64 -f ./deploy/cephcsi/image/Dockerfile . ... STEP 15: COPY . ${SRC_DIR} STEP 16: RUN make cephcsi cephcsi image settings: quay.io/cephcsi/cephcsi version canary make: git: Command not found make: git: Command not found if [ ! -d ./vendor ]; then (go mod tidy && go mod vendor); fi make: git: Command not found ... STEP 23: COMMIT Getting image source signatures ... Writing manifest to image destination Storing signatures --> 239b19c4049 git is used to detect the current commit, and store it in the binary that is built. Without the commit, the "Git Commit:" in the output is empty, making it impossible to get the exact version: $ podman run --rm 239b19c4049 --version Cephcsi Version: canary Git Commit: Go Version: go1.15 Compiler: gc Platform: linux/amd64 Kernel: 5.8.4-200.fc32.x86_64 Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-10-27 21:46:38 +00:00
Humble Chirammal	e154029e6d	deploy: update csi-attacher to v2.2.0 from v2.1.0 Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 12:34:42 +00:00
Humble Chirammal	bae289ea0a	deploy: update sidecar repo paths to new image repository The image repository has been migrated to k8s.gcr.io/sig-storage from quay.io/k8scsi. Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-10-13 12:34:42 +00:00
Nico Berlee	6a5f1380b0	deploy: add pod anti-affinity for provisioner deployments The added anti-affinity rules prevent provisioner operators from scheduling on the same nodes. The kubernetes scheduler will spread the pods across nodes to improve availability during node failures. Signed-off-by: Nico Berlee <nico.berlee@on2it.net>	2020-09-29 09:29:58 +00:00
Madhu Rajanna	f2edc926cf	deploy: remove preStop hook from daemonset templates The lifecycle preStop hook fails on container stop / exit because /bin/sh is not present in the driver registrar container image. the driver-registrar will remove the socket file before stopping. we dont need to have any preStop hook to remove the socket as it was not working as expected Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-31 17:24:54 +00:00
Madhu Rajanna	cc0f0b8a6a	deploy: remove unnecessary aggregate clusterroles The aggregate clusterrole were designed for the scenario where the rules are not completely owned by one component. the aggregate rules can be removed and simplify certain issues around upgrades. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-08-19 09:30:17 +00:00
Prasanna Kumar Kalever	404ee73dcd	cleanup: fix cmd in container img building errmsg Signed-off-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>	2020-08-19 07:06:28 +00:00
Humble Chirammal	53fa00dee8	deploy: update external provisioner version to v1.6.0 from v1.4.0 update helm chart and deploy yaml for version upgrade Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-08-08 00:11:35 +00:00
Humble Chirammal	cd107d433d	deploy: add csi-cephfsplugin provisioner deployment and role Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-08-08 00:11:35 +00:00
Madhu Rajanna	d15ded88f5	cleanup: Remove support for Delete and Unmounting v1.1.0 PVC as v1.0.0 is deprecated we need to remove the support for it in the Next coming (v3.0.0) release. This PR removes the support for the same. closes #882 Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-10 16:07:13 +00:00
Niels de Vos	7affb9289d	ci: display a warning when GO_ARCH is not set for image-cephcsi `make image-cephcsi` will fail when Golang is not installed. There is no strict requirement for Golang to be available, it is only used to gather the architecture of the OS where the image is built. It is possible to build the image successfully with `make image-cephcsi GOARCH=amd64`. In case Golang is not installed, GOARCH can not be detected automatically. This will cause a failure while installing Golang in the container image. Because the failure is not very clear, display a warning in the case the GO_ARCH (from ${GOARCH} in the Makefile) is not set. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-07-06 06:20:53 +00:00
chenxu1990	44d79e3bda	deploy: Missing quotes Missing quotes in deploy/cephfs/kubernetes/csi-provisioner-rbac.yaml Signed-off-by: chenxu1990 <xuchen1990xx@gmail.com>	2020-07-02 09:13:44 +00:00
Madhu Rajanna	9b518726ab	rbd: add hardlimt and softlimit flag added Hardlimit and Softlimit flags for cephcsi arguments. When the Softlimit is reached cephcsi will start a background task to flatten the rbd image and return success and if the hardlimit is reached it will start a background task to flatten the rbd image and return ready to use as false to make sure that the image will not be used until it is flatten. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-07-01 08:21:47 +00:00
Niels de Vos	2055d79165	build: check if installed Golang has a compatible architecture For building a arm64 container image on amd64, it is needed to configure the system specifically for that. In order to prevent including a amd64 executable in a arm64 image, a check has been added. When running an arm64 executable on a amd64 system, an error should occur when cross architecture containers are not supported. This can be triggered when running `make image-cephcsi GO_ARCH=arm64` on a amd64 system. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-06-28 17:46:37 +00:00
Niels de Vos	4fd973b924	build: use BASE_IMAGE from build.env Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-06-28 17:46:37 +00:00
Niels de Vos	f83a065c8a	build: move GOLANG_VERSION to build.env Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-06-28 17:46:37 +00:00
Humble Chirammal	65982a0489	deploy: add `--retry-interval-start` arg for attacher & resizer Considering this parameter is available for other sidecars we should have a parity between the sidecars. Adding it for the same reason Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2020-05-27 15:52:08 +00:00
Niels de Vos	2ad58f152b	build: verify that all dynamically linked libraries are available When building with go-ceph, there are several dynamically linked libraries used directly (libcephfs, librados and librbd), and many more indirectly. By adding an additional RUN statement to check if all libraries are available in the final image, problems related to missing libraries should be caught before publishing/consuming the image. Signed-off-by: Niels de Vos <ndevos@redhat.com>	2020-05-27 09:03:42 +00:00
Madhu Rajanna	a116764f4d	Snapshot: Template changes for snapshot beta Updated the deployment templates and the sidecar images version to support snapshot beta version. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-05-15 08:19:32 +00:00
Yug Gupta	9394ac887e	rbd: change image pull policy unlike other containers, image pull policy of csi-snapshotter was set to "Always", which can be changed to pull only if not present. Signed-off-by: Yug Gupta <ygupta@redhat.com>	2020-05-12 13:44:52 +00:00
Madhu Rajanna	bfa6064b4d	Read baseimage from the dockerfile Updated deploy.sh and build scripts to read base image from the dockerfile Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-22 15:41:40 +00:00
Madhu Rajanna	697ed32778	Add script to build and push multi arch images with the help of qemu-user-static we can run different architecute contains. more info at https://github.com/multiarch/qemu-user-static Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-22 15:41:40 +00:00
Madhu Rajanna	32e2a713e6	Fix multi architecture dockerfile Add support for multi architecture build for cephcsi. with multistage docker build we we build cephcsi binary for both arm64 and amd64 architecture. Signed-off-by: Madhu Rajanna <madhupr007@gmail.com>	2020-04-22 15:41:40 +00:00

1 2 3 4 5

221 Commits