Responsibilities
Participate in the system solution design of next generation multi-core multi-chip AI sub-system that scales beyond 1000s TOPS for high performance computing.
Perform system validation, use case validation, performance analysis, benchmarking, modeling to identify performance bottlenecks, and system parameters optimization.
Deliver optimized solutions that meet the demanding requirements of HPC workloads and articulate them effectively across stakeholders ranging from toolchain, software and hardware teams, domain experts, and to technology leadership.
Collaborate with hardware teams to design and optimize the system's computational components, including processors, accelerators, interconnects, and memory subsystems.
Collaborate with software developers to define and implement software frameworks, libraries, and tools that maximize performance and productivity on the target HPC architecture.
Collaborate with domain experts and application developers to understand the unique requirements of specific workloads and propose tailored architectural solutions.