Scale-out Engineer (Remote, US or Canada) Job at Yoh, A Day & Zimmermann Company, Santa Ana, CA

K1piSnN2azhVOE1CQUxlWnNhSWJva0g1Y1E9PQ==
  • Yoh, A Day & Zimmermann Company
  • Santa Ana, CA

Job Description

Seeking a skilled AI Scale-Out Software Engineer to build and optimize our clients scale-out fabric (TT-fabric) for distributed inference and training infrastructure. The ideal candidate will have expertise in deep learning, distributed systems, and low-level networking.





Responsibilities
-Design, develop, and maintain TT-fabric, a low-level networking library for AI processors built on top of Ethernet protocol
-Design and implement efficient distributed training systems for large-scale deep learning models
-Optimize network communication for multi-node AI processor clusters
-Tune system performance for inference and training of key AI models
-Work in the Metalium team and integrate scale-out APIs into the Programming Model
-Work with AI model builder and researchers to improve both the scale out infrastructure and as well as model design







Experience & Qualifications

-Bachelor's or Master’s degree in Computer Science, Electrical Engineering, or a related field.
-Proven experience in low-level software development.
-Strong proficiency in programming languages such as C / C++.
-Experience with MPI or similar distributed computing frameworks
-Experience with low-level networking libraries (e.g., libfabric, libibverbs)
-Knowledge of networking protocols, especially Ethernet and InfiniBand
-Knowledge of high-performance interconnects
-Familiarity with RDMA programming
-Familiarity with large-scale deep learning frameworks (e.g., PyTorch, TensorFlow)
-Familiarity with network offload engines and SmartNICs
-Strong communication skills and the ability to work effectively with cross-functional teams.
-Passion for technology and a commitment to pushing the boundaries of what is possible in AI.



Estimated Min Rate : $185000.00

Estimated Max Rate : $250000.00

Note: Any pay ranges displayed are estimations. Actual pay is determined by an applicant's experience, technical expertise, and other qualifications as listed in the job description. All qualified applicants are welcome to apply.

Yoh, a Day & Zimmermann company, is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Visit   to contact us if you are an individual with a disability and require accommodation in the application process.

For California applicants, qualified applicants with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. All of the material job duties described in this posting are job duties for which a criminal history may have a direct, adverse, and negative relationship, potentially resulting in the withdrawal of a conditional offer of employment.

Job Tags

Remote job,

Similar Jobs

Numero Data

Data Analyst Job at Numero Data

 ...understanding of the appropriate chart types (Bar charts, Line charts, scatter plot, Heat Maps) to use to highlight patterns in the data. Should have knowledge of administration and installation of Tableau servers. ~ Skilled on different databases like RDBMS... 

Coda Search│Staffing

Marketing Manager Job at Coda Search│Staffing

 ...Job Summary: The Marketing Manager is responsible for planning, executing, and optimizing comprehensive marketing strategies to enhance brand awareness, generate leads, and drive business growth. This role involves managing digital and traditional marketing efforts,... 

Upward Health

Nurse Practitioner Job at Upward Health

Nurse Practitioner (NP) Upward Health is a home-based medical group specializing in primary care and behavioral health for individuals with complex needs. We serve patients throughout their communities, and we diagnose, treat, and prescribe anywhere our patients call...

Alexandra Lozano Immigration Law PLLC

Client Correspondence Processing Specialist Job at Alexandra Lozano Immigration Law PLLC

Job Summary: The Client Correspondence Receiving Specialist performs duties related to the processing of correspondence and packages received in our company. The responsibilities of this position include working collaboratively with colleagues to ensure document consistency...

Philadelphia Housing Authority

Public Safety and Crime Analyst Job at Philadelphia Housing Authority

 ...available data including crime stats from Office of Public Safety (OPS), Records Management System (RMS), Philadelphia Police Department (PPD) Notifications, Public Safety Officer Data, ShotSpotter system and data concerning shootings citywide; Synthesize information...