You will be part of a dynamic team to drive growth in designing, managing and developing a Regional Data Warehouse Platform with various local and remote teams to generate insights and reports of the OTT business. Reporting to Assistant Lead, Data Innovation, you will be involved in improving/maintaining the data platform, AI/ML innovation projects and creating insightful dashboards.
Key Responsibilities
- Build and support Google Cloud Platform CDP, AI/ML and Big Data Projects
- Explore / Reconcile data using various open-source tools and/or cloud tools to generate business insights
- Work on insightful dashboards for business and operations
- Work on PoC and AI/ML projects within the group and with 3rd parties
- Solve complex data problems to deliver insights that helps the organization's business to achieve their goals
- Design, code and test data systems and work on implementing those into the internal infrastructure or cloud platform
- Build scalable data pipelines to extract, transform, load and integrate data
- Develop codes and scripts to process structured and unstructured date in real-time form a variety of data sources
- Test data pipelines for scalability and reliability to process high data volume, variety and velocity
- Consolidate and create storage solutions for storage and retrieval of information of regional data
- Gather data and generate business insight requirements by working with local Content, Marketing and Integrated Sales teams
Requirements
- Bachelor degree or Diploma in Computer Science or Computer Studies.
- 1 - 2 years Data Analysis experience
- Experience in Go/Python programming as well as in AI/ML
- Has prior experience in Data modelling, API integrations, statistics, forecasting, DML, DDL, DQL, DCL or TCL
- Familiar with or willingness to learn Google Cloud and open to explore / PoC on other cloud platforms.
- Confidence in writing SQL or willingness to improve SQL skill
- Experience or willingness to learn modern software development lifecycle, CI/CD
- Open to learn new technologies to achieve best practices
Desired:
- Experience in creating ETL workflows, using Airflow or similar open source projects.
- Experience in real-time or near real-time data ingest at Google Cloud Platform or similar.
- Willingness to learn or working experience of one or more of the followings: Serverless Data Pipelines, Asynchronous Messaging Services, Cloud Object Storage, Cloud Scheduling Systems, Fully Managed Data Workflow Orchestration Services, Multi Cloud Data Warehouses, Serverless Workflow Orchestration and VPC Networks.
- Experience in AI/ML projects and/or research, such as BQML, Vertext AI and Auto ML.