Machine Learning Computer Architect, Senior Staff - Workload Analysis

Remote Full-time
At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together, we can help shape the endless possibilities of AI. Location: Working onsite at our Santa Clara, CA headquarters 3 days per week Hybrid. The role: Machine Learning Computer Architect-Workload Analysis d-Matrix is seeking outstanding computer architects to help accelerate AI application performance at the intersection of both hardware and software, with particular focus on emerging hardware technologies (such as DIMC, D2D, PIM etc.) and emerging workloads (such as generative inference etc.). Our acceleration philosophy cuts through the system ranging from efficient tensor cores, storage, and data movements along with co-design of dataflow, and collective communication techniques. What you will do: • As a member of the architecture team, you will analyze the latest ML workloads (multi-modal LLMs, CoT reasoning models, video/audio-generation) • You will contribute Hardware and Software features that power the next generation of inference accelerators in datacenters. • This role requires to keep up the latest research in ML Architecture and Algorithms, and collaborate with different partner teams including Product, Hardware design, Compiler, Inference Server, Kernels. • Your day-to-day work will include (1) analyzing the properties of emerging machine learning algorithms and workloads and identifying functional, performance implications (2) Creating analytical models to project performance on current and future generations of d-matrix hardware (3) proposing new HW/SW features to enable or accelerate these algorithms What you will bring: Minimum: • MSEE with 7+ years of experience or PhD with 5+ years of applicable experience. • Solid grasp through academic or industry experience in multiple of the relevant areas – computer architecture, hardware software codesign, performance modeling, ML fundamentals (particularly DNNs). • Programming fluency in C/C++ or Python. • Experience with developing analytical performance models, architecture simulators for performance analysis, or hacking existing ones such as cycle-level simulators (gem5, GPGPU-Sim etc.) • Research background with publication record in top-tier architecture, or machine learning venues is a huge plus (such as ISCA, MICRO, ASPLOS, HPCA, DAC, MLSys etc.). • Self-motivated team player with strong sense of collaboration and initiative. Equal Opportunity Employment Policy d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day. d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation. Apply tot his job
Apply Now →

Similar Jobs

Senior Staff Machine Learning Engineer, AI Engineering Tools

Remote Full-time

TopCoder Python + Machine Learning

Remote Full-time

Course Developer and Instructor- Deep Learning and Advanced AI (AI Program)

Remote Full-time

Data Scientist (Senior)

Remote Full-time

AI Specialist/Engineer - Qualified Pipeline

Remote Full-time

[Remote] AI Consultant – Mortgage Domain

Remote Full-time

Professional Services Consultant, AI Security

Remote Full-time

Senior Voice AI Engineer and Consultant

Remote Full-time

Physician (MD) with AI / ML / LLM Expertise

Remote Full-time

AI Consultant – Manufacturing Domain

Remote Full-time

Experienced Full Stack Software Engineer – Web & Cloud Application Development

Remote Full-time

**Experienced Customer Support Representative – Live Chat and E-commerce Expert – Work From Home Opportunity at arenaflex**

Remote Full-time

Entry-Level Data Entry Specialist – Launch Your Career with blithequark in the Thriving Tech and E-commerce Industry with Comprehensive Training and No Prior Experience Required

Remote Full-time

Contracts Administrator

Remote Full-time

Academic Researcher

Remote Full-time

​Email Marketing Specialist Atlanta, GA

Remote Full-time

Senior Specialist, International Regulatory Affair

Remote Full-time

Experienced Full-Time Customer Support Representative – Czech Speaking & Remote Work Opportunity with blithequark

Remote Full-time

Experienced Lead Programmer for bolthires Job Application – $35/Hour – Remote Work Opportunity in Data Modeling and Analysis

Remote Full-time

[Remote] Licensed Insurance Sales Agent | 100% Remote

Remote Full-time
← Back to Home