Ready to apply your HPC and AI skills in a premier research institution? The Analytics and AI methods at Scale (AAIMS) group in the National Center for Computational Science (NCCS) is seeking hardworking research scientists in the very broad areas of intersection of High-Performance Computing (HPC) and AI.
Examples on areas of research interest include:
NLP and Foundational model pre-training, human-in-loop reinforcement learning and fine-tuning.
Performance analysis and optimization of large-scale AI/ML stack and applications including computational optimization of mixed-precision kernels, data parallelism, model-parallelism, collective communication patterns and strategies for large-scale, distributed machine learning using frameworks such as PyTorch and TensorFlow.
Operational data (Power, energy, CPU/GPU utilization, job scheduling, large scale storage and I/O traces, system logs) analytics to enable data-driven intelligence and facility innovation via novel applications of ML/AI techniques such as large-scale language model and causality analysis.
AI algorithms for handling sensitive data using methods such as Federated Learning, Differential Privacy, and Secure Multiparty Computing, among others.
Data analytics pipelines from the Edge to Exascale.
Visualization of large-scale HPC/AI discovery campaigns.
At the NCCS and its Leadership Computing Program (OLCF,) we provide world class computing facilities to applications across all computational domains and subject areas. Through this position you are able to impact and further the mission of NCCS and Oak Ridge National Laboratory. We are a diverse, dynamic environment that welcomes those with initiative and creativity.
Major Duties and Responsibilities:
Research, develop and deploy new groundbreaking tools and frameworks of scalable ML/DL on leadership computing platforms.
In collaboration with domain scientists, identify and extract ML/DL workflow, and patterns, and drive the design, development, and deployment of new scalable ML-as-service solutions.
Engage with ML academic communities to stay on top of the latest advancements in ML/DL.
Author peer reviewed papers, technical papers, reports, and proposals.
Ph.D. in Computer Science, Computer Engineering, or a field closely related to the job duties of this position.
3+ years of relevant research experience outside of Ph.D.
Experience leading major projects in laboratory or academic settings.
Strong publication record and record of developing high-impact capabilities and tools.
ORNL Ethics and Conduct:
As a member of the ORNL scientific community, you will be expected to commit to ORNL’s Research Code of Conduct. A description and a statement by the Lab Director’s office can be found here: https://www.ornl.gov/content/research-integrity
Benefits at ORNL:
ORNL offers competitive pay and benefits programs to attract and retain hardworking people. The laboratory offers many employee benefits, including medical and retirement plans and flexible work hours, to help you and your family live happy and healthy. Employee amenities such as on-site fitness, banking, and cafeteria facilities are also provided for convenience.
Other benefits include the following: Prescription Drug Plan, Dental Plan, Vision Plan, 401(k) Retirement Plan, Contributory Pension Plan, Life Insurance, Disability Benefits, Generous Vacation and Holidays, Parental Leave, Legal Insurance with Identity Theft Protection, Employee Assistance Plan, Flexible Spending Accounts, Health Savings Accounts, Wellness Programs, Educational Assistance, Relocation Assistance, and Employee Discounts.
In addition, we offer a flexible work environment that supports both the organization and the employee. A hybrid/onsite working arrangement may be available with this position.
If you have difficulty using the online application system or need an accommodation to apply due to a disability, please email: ORNLRecruiting@ornl.gov or call 1.866.963.9545.