PolyU Academy for Artificial Intelligence
Principal Engineering Manager
(Ref. 250515006-IE)
Duties
The appointee will be required to work for one of the constituent research units (to be established) under the PolyU Academy for Artificial Intelligence (PAAI). The appointee will be required to:
(a) lead and organise Large Language Model (LLM) training, encompassing the complete cycle of preparation, training and validating of language models;
(b) be responsible for the LLM architecture design, development and optimisation of Mixture of Experts (MOE) and Dense models;
(c) organise and build scalable pipelines for large-scale data spider and processing, including web data, books, papers and code, etc.;
(d) lead and conduct the capability assessment and performance analysis of LLM across various complex tasks, including reasoning abilities, knowledge depth, creativity and safety guardrails;
(e) explore the upper boundaries of LLM capabilities by developing advanced techniques for data learning efficiency and designing innovative model architectures;
(f) organise and implement strategic actions, regulations and procedures of the assigned technical team;
(g) facilitate effective resources deployment and performance evaluation of the assigned technical team;
(h) collaborate closely with internal stakeholders of the University and outside parties, including industrial partners and clients; and
(i) perform any other duties as assigned by the Director of PAAI or his/her delegates or the Senior Management of the University from time to time.
Qualifications
Applicants should have:
(a) a recognised bachelor's degree and master’s degree in Computer Science, Data Science, Business Analytics or a related discipline obtained from a university ranked top 150 by the QS/THE/ARWU Rankings;
(b) at least ten years of extensive relevant experience at managerial/specialist level in the field of AI in sizable organisations, including a minimum of three years of managerial/specialist experience in LLM R&D;
(c) successfully built AI applications using proprietary self-trained LLMs that have achieved over 10 million daily active users (e.g. ChatGPT/Claude/Doubao/Deepseek); and
(d) solid experience in language model development, with at least three complete training cycles of 50B+ parameter models on datasets exceeding 10 trillion tokens.
Preference will be given to those who have published in CCF-A journal and/or conference papers, and obtained ACM awards or equivalent.
Applicants with less experience may be considered for the post of Senior Engineering Manager.
Conditions of Service
A highly competitive remuneration package will be offered. Initial appointment will be on a fixed-term gratuity-bearing contract. Re-engagement thereafter is subject to mutual agreement.
Consideration of applications will commence on 26 May 2025 until the position is filled.
Posting date: 15 May 2025