Gaudi-tutorials

Gaudi-tutorials

Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/

Stars: 56

Visit
 screenshot

The Intel Gaudi Tutorials repository contains source files for tutorials on using PyTorch and PyTorch Lightning on the Intel Gaudi AI Processor. The tutorials cater to users from beginner to advanced levels and cover various tasks such as fine-tuning models, running inference, and setting up DeepSpeed for training large language models. Users need access to an Intel Gaudi 2 Accelerator card or node, run the Intel Gaudi PyTorch Docker image, clone the tutorial repository, install Jupyterlab, and run the Jupyterlab server to follow along with the tutorials.

README:

Intel® Gaudi® Tutorials

These are the source files for the tutorials on the Developer Website

The tutorials provide step-by-step instructions for PyTorch and PyTorch Lightning on the Intel Gaudi AI Processor, from beginner level to advanced users. These tutorials should be run with a full Intel Gaudi Node of 8 cards.

IMPORTANT: To run these Jupyter Notebooks you will need to follow these steps:

  1. Get access to an Intel Gaudi 2 Accelerator card or node. See the Get Access page on the Developer Website. Be sure to use port forwarding ssh -L 8888:localhost:8888 -L 7860:localhost:7860 -L 6006:localhost:6006 ... user@ipaddress to be able to access the notebook, run the Gradio interface, and use Tensorboard. Some of the tutorials use all of these features.
  2. Run the Intel Gaudi PyTorch Docker image. Refer to the Docker section of the Installation Guide for more information. Running the docker image will allow you access to the entire software stack without having to worry about detailed Software installation Steps.
docker run -itd --name Gaudi_Docker --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice --net=host --ipc=host vault.habana.ai/gaudi-docker/1.19.0/ubuntu22.04/habanalabs/pytorch-installer-2.5.1
docker exec -it Gaudi_Docker bash
  1. Clone this tutorial in your $HOME directory: cd ~ && git clone https://www.github.com/habanaai/Gaudi-tutorials
  2. Install Jupyterlab: python3 -m pip install jupyterlab
  3. Run the Jupyterlab Server, using the same port mapping as the ssh command: python3 -m jupyterlab_server --IdentityProvider.token='' --ServerApp.password='' --allow-root --port 8888 --ServerApp.root_dir=$HOME & and take the local URL and run that in your browser

The tutorials will cover the following domains and tasks:

Advanced

Intermediate

Getting Started

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Gaudi-tutorials

Similar Open Source Tools

For similar tasks

For similar jobs