Company Description
Our client is a permissionless, non-custodial derivatives Decentralised Exchange (DEX) offering high performance perpetual contracts on an order book model.
Key Responsibilities
- Design, develop, and maintain tools to automate operational tasks, streamline workflows, and boost engineering productivity.
- Implement and optimize continuous integration/continuous delivery (CI/CD) pipelines, ensuring rapid, secure, and scalable deployments.
- Develop and manage configuration management and infrastructure-as-code solutions for internal deployments, ensuring consistent and repeatable processes.
- Proactively troubleshoot, diagnose, and resolve software and infrastructure issues to maintain system performance and reliability.
- Track, manage, and resolve technical issues, ensuring minimal downtime and quick resolution times.
- Collaborate with engineering teams to suggest architectural and process improvements to optimize system performance and operational efficiency.
- Stay informed about the latest technologies and evaluate vendor products to improve system operations and infrastructure scalability.
- Ensure system and infrastructure security by implementing best-in-class security practices and monitoring solutions.
Key Requirements
- 7+ years of technical experience in a DevOps or similar role, supporting large-scale production systems in distributed environments.
- Expertise in managing and optimizing large-scale distributed systems.
- In-depth knowledge and hands-on experience with AWS cloud services and cloud architecture.
- Proficiency in programming languages such as Python, Go, or similar for scripting and automation purposes.
- Strong hands-on experience with tools like Docker, Terraform, Kubernetes, ELK Stack, GitHub Actions, and other DevOps tools.
- Solid understanding and experience in working with Linux-based environments.
- Self-motivated and capable of working independently in a fast-paced, dynamic environment.
- Proven experience with implementing security best practices, including vulnerability assessments, system hardening, and monitoring.
- Strong problem-solving skills, excellent communication abilities, and a collaborative mindset.
Good to haves
- Experience with multi-cloud infrastructure management.
- Familiarity with monitoring and logging solutions like Prometheus and Grafana.
- Knowledge of database management (e.g., MySQL, PostgreSQL) and caching systems (e.g., Redis, Memcached).
- Experience with disaster recovery and high-availability system setups.
We regret to inform you that only shortlisted candidates will be contacted.
More job openings on https://t.me/rockrosejobs