Job Summary
Dynatrace Expert
Responsibilities of Role:
•Deploy, configure, and manage Dynatrace platform for full-stack monitoring (infrastructure, Application ,end user experience)
•Configure application performance monitoring (APM) and distributed tracing
• Set up real-time monitoring dashboards and service flow visualization.
• Define and manage alerting policies using Dynatrace AI (Davis) for anomaly detection.
• Integrate Dynatrace with CI/CD pipelines and ITSM tools (ServiceNow).
•Monitor microservices, containers, and cloud-native workloads
•Perform root cause analysis using Smart scape topology and AI engine
•Automate deployments using APIs, scripts, and Infrastructure-as-Code tools.
• Configure SLOs, SLIs, and synthetic monitoring for user journeys.
• Support and train teams on Dynatrace usage and best practices Terraform Expert Responsibilities of Role:
•Design and implement Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments.
•Develop reusable Terraform modules for scalable infrastructure deployments.
•Manage provisioning of cloud resources across AWS, Azure, and GCP.
•Implement Terraform state management (remote backends like S3, Azure Storage).
•Integrate Terraform with CI/CD pipelines for automated deployments.
•Ensure infrastructure versioning and change tracking using Git.
•Implement security best practices and compliance policies in Terraform code.
•Perform infrastructure cost optimization and resource lifecycle management.
•Troubleshoot and resolve infrastructure provisioning issues.
•Collaborate with DevOps and Cloud teams for automation initiatives.
•Automate infrastructure provisioning using Terraform and orchestration tools.
•Maintain proper documentation and governance of IaC standards. Ansible Expert Responsibilities of Role:
•Design, develop, and maintai
PRTG Expert
Responsibilities of Role:
•Install, configure, and manage PRTG Network Monitor environments.
•Monitor network devices, servers, applications, and bandwidth usage.
•Configure sensors (SNMP, WMI, HTTP, Flow, etc.) for various monitoring needs.
•Design dashboards and maps for real-time infrastructure visibility.
•Configure alerts, notifications, and escalation policies.
•Optimize performance and scalability of PRTG monitoring systems.
•Perform root cause analysis and incident troubleshooting.
•Integrate PRTG with third-party systems and APIs.
•Automate monitoring setup and maintenance tasks.
•Manage distributed monitoring using remote probes.
•Conduct regular health checks and system upgrades.
Nagios Expert
Responsibilities of Role:
•Install, configure, and maintain Nagios Core / Nagios XI monitoring systems.
•Monitor infrastructure, applications, servers, and network devices.
•Configure hosts, services, plugins, and custom checks.
•Develop and manage alerting and notification mechanisms.
•Create dashboards and reports for system health monitoring.
•Integrate Nagios with third-party tools and scripts.
•Develop custom plugins using Bash, Python, or Perl.
•Perform root cause analysis for detected issues.
•Ensure high availability and performance of monitoring systems.
•Automate monitoring configurations and maintenance tasks.
•Upgrade and patch Nagios environments
Key Responsibilities
PowerShell Expert - Responsibilities of Role:
•Develop, maintain, and optimize PowerShell scripts for automation of IT operations and administrative tasks.
•Automate infrastructure management, system configuration, and deployment processes.
•Create scripts for user management, system monitoring, and application maintenance.
•Integrate PowerShell automation with cloud platforms (Azure, AWS).
•Develop reusable script modules and maintain script repositories.
•Automate patch management, backups, and system health checks.
•Troubleshoot and debug scripts to resolve operational issues.
•Integrate PowerShell with APIs, REST services, and third-party tools.
•Work with DevOps teams to embed automation into CI/CD pipelines.
•Manage Windows Server environments using PowerShell and DSC (Desired State Configuration).
•Implement security best practices including credential management and secure scripting.
•Document automation processes and provide training to teams
Ansible Expert
Responsibilities of Role:
•Design, develop, and maintain Ansible playbooks, roles, and collections for automated provisioning and configuration.
•Implement Infrastructure as Code (IaC) practices using Ansible in hybrid cloud or on-premises environments.
•Automate repetitive tasks, reducing manual errors and deployment times across development, staging, and production systems.
•Manage patching, system updates, and application deployments using Ansible automation.
•Collaborate with development, security, and operations teams to implement consistent configuration standards.
•Integrate Ansible with CI/CD pipelines (GitLabCI, Jenkins, Azure DevOps, etc.) for automated deployment workflows.
•Maintain and optimize Ansible Tower / AWX for centralized job execution and inventory management.
•Document infrastructure automation processes and provide training to internal teams.
Terraform Expert
Responsibilities of Role:
•Design and implement Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments.
•Develop reusable Terraform modules for scalable infrastructure deployments.
•Manage provisioning of cloud resources across AWS, Azure, and GCP.
•Implement Terraform state management (remote backends like S3, Azure Storage).
•Integrate Terraform with CI/CD pipelines for automated deployments.
•Ensure infrastructure versioning and change tracking using Git.
•Implement security best practices and compliance policies in Terraform code.
•Perform infrastructure cost optimization and resource lifecycle management.
•Troubleshoot and resolve infrastructure provisioning issues.
•Collaborate with DevOps and Cloud teams for automation initiatives.
•Automate infrastructure provisioning using Terraform and orchestration tools.
•Maintain proper documentation and governance of IaC standards.
Grafana Expert
Responsibilities of Role:
•Design and maintain scalable Grafana dashboards for real-time monitoring of
infrastructure , applications and business KPIs
• Integrate Grafana with a wide range of data-producing systems , including infrastructure
-
- components , application telemetry sources and cloud service monitoring endpoints to
- enable unified and real time visualization of operational metrics
Skill Requirements
- Dynatrace Expert
Expertise:
•Monitoring: Full-stack observability, real user monitoring (RUM), synthetic monitoring
•Cloud Platforms: AWS, Azure, GCP (integration with Dynatrace)
•Container Monitoring: Kubernetes, OpenShift, Docker
Terraform Expert
Expertise:
•IaCTools: Terraform (Core, Cloud), Terragrunt
•Cloud Platforms: AWS, Azure, GCP
•ConfigurationMgmt: Ansible (good to have)
Ansible Expert Expertise:
1.Ansible Tools: Ansible Core, Ansible Galaxy,AnsibleTower / AWX, AnsibleVault
2.Automation & CI/CD: Jenkins, GitLabCI, GitHub Actions, Azure DevOps
3.Configuration Management: Puppet (basic), Chef (basic), SaltStack(basic)
PowerShell Expert Expertise
•Scripting & Automation: PowerShell (advanced scripting, modules, functions)
•Windows Administration: Active Directory, Windows Server, Exchange
•Automation Frameworks: PowerShell DSC, Azure Automation
Other Requirements
PRTG Expert
Expertise:
•Monitoring Tools: PRTG Network Monitor
•Protocols: SNMP, WMI, NetFlow, sFlow, HTTP/HTTPS
Nagios Expert
Expertise
•Monitoring Tools: Nagios Core, Nagios XI
•Plugins: NRPE, NCPA, custom plugin development
Grafana Expert
Expertise
1.Visualization Tools: Grafana, Kibana
2.Metrics & Monitoring: Prometheus, InfluxDB, Graphite, Telegraf, CollectD
PRTG Expert – 12+ years of IT experience with 3+ years in PRTG
PowerShell Expert - 12+ years of IT experience with 4+ years in PowerShell scripting and automation
Ansible -12+ years experience
Nagios -12+ years of IT experience with 3+ years in Nagios
Grafana Expert -12+ years experience
Dynatrace Expert - 12+ years of overall IT experience with 4+ years in Dynatrace
Terraform -12+ years of IT experience with 4+ years in Terraform