Working day: Monday to Friday, 9am to 6pm
Working location: Tanjong Pagar
Job Responsibilities:
- Responsible for software installation, deployment, configuration, monitoring, optimization, and maintenance in server-related applications.
- Handle the daily operation and maintenance of basic services of various systems.
- Perform configuration updates and emergency response to failures to ensure high availability of the business.
- Manage company project upgrades, requirements, and resource implementation.
- Support development requirements, testing, and adjustments on the operation and maintenance platform.
- Plan, deploy, implement, and monitor system operations and maintenance tasks.
- Conduct data backup, disaster recovery, data transfer, log analysis, and troubleshooting.
- Improve system architecture and maintain related operation and maintenance documentation.
- Design and implement automated tools related to operation and maintenance.
- Develop systems such as build systems, continuous integration systems, automated testing platforms, and code quality management systems.
- Maintain the software development pipeline automation to enable Continuous Integration (CI) and Continuous Deployment (CD).
- Implement infrastructure and operating environments for container-based microservices, ensuring high availability, performance, and security.
- Any other ad-hoc duties assigned.
Job requirements:
- At least a Diploma or Bachelor's degree in Computer Science, Information Technology, or a related field.
- Extensive experience with Cloud hosts, such as AWS or Alibaba Cloud.
- Proficiency in Linux environments including deployment, configuration, and tuning.
- Strong scripting skills (Shell/Python) for operation and maintenance automation.
- Expertise in CI/CD tools and practices, and configuration management tools such as Ansible and Jenkins.
- Familiarity with monitoring tools such as Zabbix and ELK.
- Experience with Kubernetes and Docker, including installation, deployment, configuration, and optimization.
- Company assets, UPS management, and server room maintenance.
- Management of servers, SVN, network, platform, DNS, VPN, and mail services.
- Setup and maintenance of monitoring systems.
- Management of frontend and backend DevOps platforms.
- Setup new laptops for new employees.
- Erase and repurpose laptops for departing employees.
- Troubleshoot and fix laptop issues.