amd-shark-ai

amd-shark-ai

AMD-SHARK Inference Modeling and Serving

Stars: 62

Visit
 screenshot

The amdshark-ai repository contains the amdshark Modeling and Serving Libraries, which include sub-projects like shortfin for high performance inference, amdsharktank for model recipes and conversion tools, and amdsharktuner for tuning program performance. Developers can find API documentation, programming guides, and support matrix for various models within the repository.

README:

amdshark-ai: amdshark Modeling and Serving Libraries

GitHub License pre-commit

amdshark Users

If you're looking to use amdshark check out our User Guide. For developers continue to read on.

Sub-projects

PyPI version CI - shortfin

The shortfin sub-project is amdshark's high performance inference library and serving engine.

  • API documentation for shortfin is available on readthedocs.

PyPI version CI - amdsharktank

The amdshark Tank sub-project contains a collection of model recipes and conversion tools to produce inference-optimized programs.

PyPI version CI - amdsharktuner

The amdshark Tuner sub-project assists with tuning program performance by searching for optimal parameter configurations to use during model compilation. Check out the readme for more details.

Support matrix

Models

Model name Model recipes Serving apps Guide
SDXL amdsharktank/amdsharktank/models/punet/ shortfin/python/shortfin_apps/sd/ shortfin/python/shortfin_apps/sd/README.md
llama amdsharktank/amdsharktank/models/llama/ shortfin/python/shortfin_apps/llm/ docs/shortfin/llm/user/llama_serving.md
Flux amdsharktank/amdsharktank/models/flux/ shortfin/python/shortfin_apps/flux/ shortfin/python/shortfin_apps/flux/README.md

amdshark Developers

If you're looking to develop amdshark, check out our Developer Guide.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for amd-shark-ai

Similar Open Source Tools

For similar tasks

For similar jobs