Company
Barcelona Supercomputing Center - Centro Nacional de Supercomputación
Company Website
Where
Info
Full Time
Closes: 17 July 2024
Applications have closed
Research Engineer – HPC memory systems (RE1)

Context And Mission
The Memory Systems group in the Computer Science Department at the Barcelona Supercomputing Center is offering a full-time research engineer position for a project that explores advanced memory systems for AI applications.

The performance of AI models is typically limited by the memory wall. In this project, we aim to (re)move this wall by designing memory systems tailored to the requirements of AI models. We will explore systems based on HBM, DDR, GDDR devices, and Compute Express Link (CXL) memory expanders. Additionally, we will consider near-data processing architectures that offload part of the CPU/GPU processing to the memory system.

The project is conducted in close collaboration with a major memory manufacturer. Most of the development will take place on high-end hardware products and industrial prototypes.

We strongly encourage applications from candidates interested in joining a research team with over a decade of experience in industrial projects with major hardware companies from the US, China, and Korea.

Key Duties
– Profiling and performance modeling of HPC applications running on high-end servers with advanced memory systems: HBM, DDR4/5, CXL memory expanders.
– Exploration of Processing in Memory for HPC and AI applications.
– Prototyping of industrial products and tools.
– Close interaction and reporting to industrial partners.
– Benchmarking and performance analysis of high-end hardware products and industrial prototypes.

Requirements

Education

– BSc, MSc or PhD in Computer Science

Essential Knowledge and Professional Experience

– Good knowledge of computer architecture
– Experience with Unix/Linux environments
– Experience with profiling tools: Hardware counters, PAPI interface
– Programming languages (C/C++, Python)

Additional Knowledge and Professional Experience

– Understanding of memory systems and memory technologies (e.g. DRAM, HBM) is a plus
– Experience with hardware/system simulators is a plus
– Fluency in English is essential

Competences

– Good written and verbal communication skills
– Ability to take initiative

Conditions
– The position will be located at BSC within the Computer Sciences Department
– We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, support to the relocation procedures
– Duration: Open-ended contract due to technical and scientific activities linked to the project and budget duration
– Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
– Salary: we offer a competitive salary commensurate with the qualifications and experience of the candidate and according to the cost of living in Barcelona
– Starting date: Q2-Q3 2024