Large Language Model (LLM) Architect
We are seeking an experienced Large Language Model (LLM) Architect to join our team. The ideal candidate will be responsible for designing, developing, and optimizing large-scale language models to enhance our AI capabilities. This role involves working closely with data scientists, engineers, and product managers to deliver state-of-the-art AI solutions
Responsibility:
- Design and optimize large language models, devise fine-tuning strategies, and streamline the training process.
- Develop systems for efficient model training and deployment, involving data preprocessing, parallel training, and resource management.
- Establish performance evaluation systems and monitor training metrics to ensure model quality and iteration efficiency.
- Lead and collaborate with cross-functional teams to apply large model technology in practical business scenarios.
Job Requirements:
- Degree or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields.
- Strong in deep learning theory and mainstream frameworks like PyTorch and TensorFlow.
- Proven experience in developing and deploying large language models, with a deep understanding of natural language processing (NLP) techniques.