AI21 Labs - Deep Learning Engineer

Tel Aviv Area
Tel Aviv-Yafo
Full-time
Senior
On-site
Deep Learning, Machine Learning, TensorFlow, PyTorch, Hugging Face, Python, Production Systems, Cloud Computing (AWS, GCP, Docker, Kubernetes), Distributed Systems, Custom Kernel (C++/CUDA, Triton), Optimization, Large Language Models (LLMs), AI

Advance your **Artificial Intelligence career in Israel** as a Deep Learning Engineer at AI21 Labs, a pioneering company in Foundation Models and AI Systems. We are seeking a highly experienced professional to be responsible for maintaining and improving our cutting-edge training infrastructure, developing, scaling, and testing new ideas, and optimizing code for the newest hardware accelerators. This is a crucial **Israeli tech job AI** where you will contribute to the development of multi-billion parameter Large Language Models (LLMs) and ensure their efficient serving. If you’re looking for impactful **AI jobs in Israel**, this role offers the chance to work with advanced engineering for large-scale distributed training on thousands of cores.

**Role and Responsibilities for this Deep Learning Engineer AI job opportunity:**
* Develop Large Language Models as part of applied research projects, supporting the AI21 Platform, including designing, implementing, and training massive-scale deep language models.
* Implement, optimize, scale, and test new cutting-edge ideas and architectures.
* Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, and add support for new benchmarks.
* Maintain and improve our training infrastructure.
* Adapt code to run on and best utilize the newest and most advanced hardware accelerators.

**Requirements for this leading AI job in Tel Aviv:**
* B.Sc. in Computer Science, Software Engineering, or equivalent.
* Self-learner, with a proven record of ability to remove technical road-blocks.
* 5+ years experience developing software for production systems and/or internal infrastructure/tools.
* Prior experience working with cloud computing platforms (e.g., AWS, GCP, Docker, Kubernetes).
* Skilled at writing production-grade Python code.
* Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch, etc.).
* Experience in any of the following: Optimization of deep learning model training (e.g., parallelization, Megatron, DeepSpeed, FSDP), Custom kernel experience (C++/CUDA and/or Triton), or Distributed Systems, particularly distributed deep learning training/serving.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.