On behalf of Pentas Vision, a subsidiary company of Sony Semiconductor Solutions Corporation (SSS), we are seeking for a Cloud DevOps Engineerwho is highly analytical and driven to create and optimise engineering algorithm strategies aligned with the business expertise and purpose.
Pentas Vision specialises in the research and development of Artificial Intelligence (AI) and in particular - Computer Vision technologies. Our core expertise lies in integrating AI solutions with diverse Image Sensors, facilitating seamless incorporation into embedded devices like mobile phones, IoT cameras, and other relevant devices.
What you will be doing
- Design, develop, and deploy scalable AutoML applications on Azure, leveraging services such as Azure Machine Learning, Azure Databricks, and Azure Functions.
- Collaborate with data scientists and AI engineers to integrate and optimize AutoML models and pipelines.
- Architect and manage cloud infrastructure on Azure, including AKS, virtual machines, containers, databases, and networking components.
- As SRE(Site Reliability Engineer), Ensure high availability, scalability, and security of cloud resources, adhering to best practices and organizational standards.
- Monitor and maintain the health and performance of cloud applications and services, utilizing Azure Monitor, Azure Log Analytics, and other monitoring tools.
- Troubleshoot and resolve issues related to cloud infrastructure, applications, and AutoML workflows.
- Conduct regular reviews and optimizations of cloud resource usage to manage costs and enhance performance.
- Create and maintain comprehensive documentation for cloud infrastructure, AutoML applications, and operational procedures.
- Work closely with cross-functional teams, including data science, software engineering, and IT operations, to support project goals and deliverables.
- Implement and maintain rigorous testing and quality assurance processes to ensure application reliability
- Identify potential risks and issues, develop mitigation strategies, and proactively resolve problems that may affect project success
What you should have
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 3+ years of experience in cloud engineering
- Experience of implementation and maintenance of cloud platform system, with K8S cluster deployment and operation and maintenance experience.
- Experience with Docker and Kubernetes, with experience in designing, developing, deploying and managing containerized applications.
- Experience with MySQL or other relational databases and performance tuning
- Strong knowledge in IaC framework and tool, CICD pipeline using Github action and ensure high quality system delivery.
- Familiar with of cloud system monitoring and operation
- Familiar with cloud system HA, scalability, security & optimization
- Experience with additional programming languages such as Python, PowerShell, or Bash.
- Basic Linux and network skills
- Excellent networking and interpersonal skills.
- Highly organised, diligent with great attention to detail.
- Problem solving, and analytical thinking.
- Active listener, able to interpret and play back stakeholder requirements.
- Highly driven and able to work individually and as part of a team.
What will help you succeed
- Advanced degree in Computer Science, Engineering, or a related field.
- Certifications:
- Microsoft Certified: Azure Solutions Architect Expert
- Microsoft Certified: Azure DevOps Engineer Expert
- Microsoft Certified: Azure AI Engineer Associate
- Certified Kubernetes Administrator (CKA)
- Experience with a strong focus on Azure services and AutoML applications.
- Proficient in Redis caching and messaging middleware (Servicebus or RabbitMQ).
- Proficient of Infrastructure as Code (IaC) tools such as Terraform or ARM templates, best practices, deployments, debugging, and troubleshooting.
- Experience with advanced Azure services and integrations.
- Experience to develop k8s applications and CI/CD pipeline(like argo CD, github action, flux, etc.)
- Experience to develop asynchronous tasks on k8s(like KEDA, argo workflow, etc.)
- Proven track record of managing large-scale cloud deployments.
- Experience with additional programming languages such as Python, PowerShell, or Bash.
- Knowledge of additional AutoML and machine learning frameworks.
- Experience to reduce Toil as SRE
Benefits you will have
- Flexible work arrangement because we understand Life happens.
- Comprehensive medical benefits that includes term life insurance benefits.
- Allowances and variable bonus.
- Special staff purchase rates.
- Milestone gifts such as marriage gift to celebrate personal milestones.
- Wellness activities to promote a healthy lifestyle.
- Curated training programmes to encourage continuous professional development.
We strive to create a place for you to realise your potential and inspire you to make positive impact through innovation, smart collaboration and boundless curiosity. We are looking for people who believe that they can enrich lives and help us achieve our purpose fill the world with emotion, through the power of creativity and technology.