Job Description
Having witnessed the achievements of the large models in 2023, there has been a new surge of machine learning and AI models in recent days, which again puts up the public concerns about the explosively increasing energy consumption during the model training phase. According to Dr. Sajjad Moazeni, an AI researcher at University of Washington, the training of GPT-3 roughly consumes 10 gigawatt-hour of electricity ^([1]), approximately equivalent to the daily electricity consumption of 440,000 average Singapore households in 2021 ^([2]).
Significant efforts have been devoted to improving the efficiency of the model training phase, in terms of hardware, model architecture, networking, and algorithm design, etc. The PI (Dr. Zhang Junyu) of this project would like to tackle this problem from the algorithmic perspective, by developing theoretical understanding of the open questions on the training algorithms, in terms of the convergence, complexity, local landscape properties, and potential algorithmic improvement based on the new theoretical findings.
[1] Data from UW News. [2] Data from Statista
Qualifications
Interested applicants are required to possess a PhD in 2024. He/she should have a good understanding in
1. convergence and complexity analysis for (nonconvex) optimization algorithms
2. stochastic process and martingale theory
3. semi-algebraic and subanalytic geometry
4. stochastic approximation methods
5. dynamical systems
The applicant should also be experienced in MATLAB and Python coding. In particular, he/she should have the ability to adapt base codes of PyTorch to implement new algorithms instead of calling built-in functions.
|
More Information
Location: Kent Ridge Campus
Organization: College of Design and Engineering
Department : Industrial Systems Engineering and Management
Employee Referral Eligible: No
Job requisition ID : 23720
Similar Positions
-
Research Fellow (Wireless Communication And Ai), Nanyang Technological University, Singapore, about 8 hours ago
We are looking for a research fellow to work in our project on Channel modelling and signal processing for terahertz link. The project aims to develop channel models and efficient signal processin...
-
Research Assistant (Computer Science), Nanyang Technological University, Singapore, 1 day ago
The School of Computer Science and Engineering (SCSE) invites applications for the position of Research Assistant. Key Responsibilities: Develop new paradigms for intelligent human-computer intera...
-
Research Fellow (Computer Science), Nanyang Technological University, Singapore, 1 day ago
The School of Computer Science and Engineering (SCSE) invites applications for the position of Research Fellow. Key Responsibilities: Develop new paradigms for intelligent human-computer interact...
-
Research Assistant (Image Analysis), Nanyang Technological University, Singapore, 1 day ago
The School of Electrical and Electronic Engineering is one of the founding Schools of the Nanyang Technological University. Built on a culture of excellence, the School is renowned for its high ac...
-
Research Fellow (Computer Science), Nanyang Technological University, Singapore, about 7 hours ago
The School of Computer Science and Engineering (SCSE) invites applications for the position of Research Fellow. The Research Fellow will be responsible for the development of a software platform t...
-
Research Assistant (Statistical Physics), Nanyang Technological University, Singapore, 8 days ago
Division of Physics and Applied Physics (PAP) in School of Physical and Mathematical Sciences (SPMS) is looking for a candidate to join them as a Research Assistant. The College of Science seeks a...