Role & seniority

Python Developer SRE & Automation Engineer (Senior), 8+ years experience

Stack/tools

Python (API development, automation, scripting)
Linux/Unix administration, shell scripting
SQL (PostgreSQL, MySQL, etc.)
Monitoring/observability: Prometheus, Grafana, ELK, Datadog, Dynatrace
CI/CD & automation: Jenkins, GitHub Actions, Terraform, Ansible
Infrastructure: cloud-native (AWS/Azure/GCP), networking fundamentals
Containerization: Docker, Kubernetes (nice-to-have)

Top 3 responsibilities

Design, automate, and optimize production systems; develop and maintain automation tools and scripts
Implement/manage monitoring, alerting, observability; perform root cause analysis and resolve incidents
Design/enhance CI/CD pipelines and infrastructure automation; ensure high availability, performance, and reliability

Must-have skills

8+ years in software development or site reliability engineering
Strong Python programming, including automation and API development
Linux/Unix administration and scripting
Proficient SQL skills; scripting for analytics and troubleshooting
Experience with monitoring/observability tools (Prometheus, Grafana, ELK, Datadog, Dynatrace)
Automation pipelines/tools (Jenkins, GitHub Actions, Terraform, Ansible)
Networking fundamentals and cloud-native environments (AWS/Azure/GCP)

Nice-to-haves

Containerization expertise (Docker, Kubernetes)
Practical experience in capacity planning and SLO/SLI/SLA definition

Locat

Full Description

Dice is the leading career destination for tech experts at every stage of their careers. Our client, Floga technologies, is seeking the following. Apply via Dice today!

Job Title: Python Developer SRE & Automation Engineer

Location: 100% Remote

Duration: Long Term

Experience Level: 8 15+ years

Position Summary

We are seeking a highly skilled Python Developer with strong Site Reliability Engineering (SRE) experience to design, automate, and optimize production systems. The ideal candidate will have a solid foundation in Python development, Linux administration, SQL, and modern monitoring and troubleshooting practices.

You will be responsible for improving system reliability, performance, and scalability through automation and proactive monitoring - ensuring our applications and infrastructure run efficiently at scale.

Key Responsibilities

Develop and maintain automation tools and scripts using Python to streamline operational and deployment processes. Implement and manage monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK, Datadog, or Dynatrace). Collaborate with development, DevOps, and infrastructure teams to ensure high availability, performance, and reliability of applications. Perform root cause analysis and resolve complex production incidents, minimizing downtime and service impact. Design and enhance CI/CD pipelines and infrastructure automation using tools like Jenkins, Ansible, or Terraform. Write and optimize SQL queries for performance monitoring, analytics, and troubleshooting. Manage and monitor systems on Linux environments, handling configuration, tuning, and process automation. Define SLOs, SLIs, and SLAs to ensure measurable service reliability. Continuously improve system resilience and fault tolerance through testing and capacity planning.

Required Skills & Experience

8+ years of total experience in software development or system reliability engineering. Strong programming expertise in Python, including API development, automation, and scripting. Solid knowledge of Linux/Unix system administration and shell scripting. Strong understanding of SQL (PostgreSQL, MySQL, or similar). Proven experience in troubleshooting complex distributed systems. Hands-on experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack, Datadog, Dynatrace, etc.). Experience building or maintaining automation pipelines (Jenkins, GitHub Actions, Terraform, Ansible, etc.). Knowledge of networking fundamentals, load balancing, and cloud-native environments (AWS/Azure/Google Cloud Platform). Familiarity with containerization (Docker, Kubernetes) is a plus.

FLG -55 - Python Developer SRE & Automation Engineer - 100% Remote

Full Description

Position Summary

Key Responsibilities

Required Skills & Experience