Cookies & analytics consent
We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
Read how we use data in our Privacy Policy and Terms of Service.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.

Jobs via Dice • Texas, United States
Role & seniority
Stack/tools
Python (API development, automation, scripting)
Linux/Unix administration, shell scripting
SQL (PostgreSQL, MySQL, etc.)
Monitoring/observability: Prometheus, Grafana, ELK, Datadog, Dynatrace
CI/CD & automation: Jenkins, GitHub Actions, Terraform, Ansible
Infrastructure: cloud-native (AWS/Azure/GCP), networking fundamentals
Containerization: Docker, Kubernetes (nice-to-have)
Top 3 responsibilities
Design, automate, and optimize production systems; develop and maintain automation tools and scripts
Implement/manage monitoring, alerting, observability; perform root cause analysis and resolve incidents
Design/enhance CI/CD pipelines and infrastructure automation; ensure high availability, performance, and reliability
Must-have skills
8+ years in software development or site reliability engineering
Strong Python programming, including automation and API development
Linux/Unix administration and scripting
Proficient SQL skills; scripting for analytics and troubleshooting
Experience with monitoring/observability tools (Prometheus, Grafana, ELK, Datadog, Dynatrace)
Automation pipelines/tools (Jenkins, GitHub Actions, Terraform, Ansible)
Networking fundamentals and cloud-native environments (AWS/Azure/GCP)
Nice-to-haves
Containerization expertise (Docker, Kubernetes)
Practical experience in capacity planning and SLO/SLI/SLA definition
Locat
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Floga technologies, is seeking the following. Apply via Dice today!
Job Title: Python Developer SRE & Automation Engineer
Location: 100% Remote
Duration: Long Term
Experience Level: 8 15+ years
We are seeking a highly skilled Python Developer with strong Site Reliability Engineering (SRE) experience to design, automate, and optimize production systems. The ideal candidate will have a solid foundation in Python development, Linux administration, SQL, and modern monitoring and troubleshooting practices.
You will be responsible for improving system reliability, performance, and scalability through automation and proactive monitoring - ensuring our applications and infrastructure run efficiently at scale.
Develop and maintain automation tools and scripts using Python to streamline operational and deployment processes. Implement and manage monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK, Datadog, or Dynatrace). Collaborate with development, DevOps, and infrastructure teams to ensure high availability, performance, and reliability of applications. Perform root cause analysis and resolve complex production incidents, minimizing downtime and service impact. Design and enhance CI/CD pipelines and infrastructure automation using tools like Jenkins, Ansible, or Terraform. Write and optimize SQL queries for performance monitoring, analytics, and troubleshooting. Manage and monitor systems on Linux environments, handling configuration, tuning, and process automation. Define SLOs, SLIs, and SLAs to ensure measurable service reliability. Continuously improve system resilience and fault tolerance through testing and capacity planning.
8+ years of total experience in software development or system reliability engineering. Strong programming expertise in Python, including API development, automation, and scripting. Solid knowledge of Linux/Unix system administration and shell scripting. Strong understanding of SQL (PostgreSQL, MySQL, or similar). Proven experience in troubleshooting complex distributed systems. Hands-on experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack, Datadog, Dynatrace, etc.). Experience building or maintaining automation pipelines (Jenkins, GitHub Actions, Terraform, Ansible, etc.). Knowledge of networking fundamentals, load balancing, and cloud-native environments (AWS/Azure/Google Cloud Platform). Familiarity with containerization (Docker, Kubernetes) is a plus.