Job Summary
Kubernetes & OpenShift Administration\r\nInstall, configure, upgrade, and manage OpenShift Container Platform (OCP) using IPI/UPI methods.\r\nAdminister and maintain Kubernetes and OpenShift clusters for high availability and performance.\r\nManage Operators, Custom Resource Definitions (CRDs), and cluster scaling.\r\nPerform periodic upgrades aligned with Red Hat releases and SLAs.\r\nTroubleshoot cluster, infrastructure, and application-level issues.\r\nImplement security best practices and RBAC policies within OpenShift.\r\nAutomation, Monitoring & CI/CD\r\nAutomate administration tasks using Ansible and shell scripting.\r\nSupport and integrate CI/CD pipelines (Jenkins/OpenShift Pipelines or similar).\r\nImplement and maintain monitoring, logging, and alerting solutions:\r\nPrometheus, Grafana, Loki\r\nELK Stack (Elasticsearch, Logstash, Kibana)\r\nOptimize cluster and application performance.
Key Responsibilities
trong hands-on experience with Linux (RHEL) administration.\\\\r\\\\nProven experience with Kubernetes and OpenShift administration.\\\\r\\\\nExperience with Red Hat Satellite, patching, and system hardening.\\\\r\\\\nSolid understanding of Linux boot process, storage, networking, and security.
Skill Requirements
Linux Administration\r\nAdminister and support Red Hat Enterprise Linux (RHEL) 6/7/8/9 environments.\r\nPerform OS patching and lifecycle management using Red Hat Satellite, YUM, and third-party repositories.\r\nHandle Linux OS recovery, including bare-metal recovery from backups.\r\nTroubleshoot boot issues, kernel panics, LVM, filesystem, and performance issues.\r\nManage NFS/CIFS, storage integrations (SAN/NAS), and Oracle disk mappings with udev rules.\r\nImplement and troubleshoot SSH (key-based authentication), PAM, LDAP/IPA/IDM integrations.\r\nMonitor server performance using tools like top, sar, vmstat, and system logs.\r\nFollow ITIL/ITSM processes, SLAs, DRP, and BCP requirements.
Other Requirements
collaboration & Documentation\r\nWork closely with development teams to support deployments and operations.