⭐ Star AlbumentationsX on GitHub — 448+ stars and counting!

cortex.tensorrt-llm

janhq/cortex.tensorrt-llm

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

42stars

Homepage View on GitHub

Forks

Open issues

Watchers

Size

279.2 MB

C++Apache License 2.0

janllmnvidiatensorrttensorrt-llm

Created: Mar 4, 2024

Updated: Aug 29, 2025

Last push: Sep 26, 2024

ArchivedFork