Description and Requirements

职务描述

【Job Responsibility】

LLM Model Deployment and Optimization: Participate in the architecture design, training process establishment and optimization of large language models, including but not limited to model pre-training, fine-tuning and adaptation for specific tasks. Utilize and improve the attention mechanism to enhance the model's language understanding, generation and reasoning abilities. Explore strategies such as model parallelism and mixed-precision training to improve training efficiency and resource utilization, and ensure the stable training and efficient convergence of the model on large-scale datasets.
Model Evaluation and Optimization: Use a variety of evaluation metrics and manual evaluation methods to conduct quantitative and qualitative evaluations on the performance, quality and effect of LLM models. Based on the evaluation results, conduct in-depth analysis of the model's strengths and weaknesses, and formulate targeted optimization strategies to continuously optimize the model's performance in different application scenarios.
Cross-Team Collaboration and Project Promotion: Work closely with other teams to integrate LLM technology into actual products and business applications. Actively participate in internal technical discussions and brainstorming sessions of the team, and provide valuable suggestions and ideas for the team's technological development direction.

【Job Requirement】

Educational Background: Bachelor's degree or above in computer science, artificial intelligence, machine learning and other related majors, with a solid foundation in mathematics and statistics.
Professional Skills: Proficient in the Python programming language, proficient in at least one deep learning framework (such as TensorFlow, PyTorch), proficient in using basic frameworks such as Langchain and LLAMA index, possess solid theoretical knowledge and practical experience in deep learning, have an in-depth understanding of the principles and technical details of large language models, such as the Transformer architecture, language model pre-training methods, fine-tuning techniques, etc. Be familiar with the basic tasks and technologies in the field of natural language processing, including but not limited to text classification, sentiment analysis, machine translation, question answering systems, etc. Have good algorithm design and programming abilities, conduct efficient code optimization and debugging, master data structure and algorithm design, and be familiar with common machine learning algorithms and tools.
Project Experience: Be able to independently undertake the design and implementation of knowledge base agents, and possess good code norms and software engineering literacy.
Problem-Solving Ability: Have sharp technical insight and problem-solving ability, be able to independently analyze and solve complex technical problems in the process of model training and fine-tuning, be good at using innovative thinking to propose effective solutions, and ensure the feasibility and efficiency of technical solutions through experimental verification and optimization.
Team Collaboration and Communication: Have excellent team collaboration spirit and communication ability, clearly and accurately expound technical schemes and ideas, be good at listening to the opinions and suggestions of others, jointly overcome technical difficulties, possess good document writing ability, and be able to write clear and standardized technical documents to facilitate knowledge sharing and technology inheritance among team members.

【Required Skill if you want to Especially Note】
Those with experience in the online deployment of actual AI products are preferred.
Those who can independently complete the preprocessing of training sets and have experience in model stitching are preferred.

【Other Requirements if you have, such as Related Work Experience, etc】
Have at least 2 years of experience in AI project development.