TensorCraft-HPC

TensorCraft-HPC is a modern C++/CUDA AI kernel library for studying and validating GEMM, attention, convolution, normalization, sparse operators, and quantization.

Repository Overview

Header-first kernel library under include/tensorcraft/
Python bindings in src/python_ops/
Tests in tests/
Benchmarks in benchmarks/
Project docs on GitHub Pages

Quick Start

Recommended on a CUDA development machine:

cmake --preset dev
cmake --build --preset dev --parallel 2
ctest --preset dev --output-on-failure
python -m pip install -e .
python -c "import tensorcraft_ops as tc; print(tc.__version__)"

Build Presets

dev: recommended day-to-day CUDA development preset; single architecture, tests on, Python on
python-dev: lighter CUDA preset focused on building tensorcraft_ops
release: heavier full build, including benchmarks
cpu-smoke: CPU-only configure/install smoke validation; tests and Python bindings are disabled

Build Notes

This repository targets the local CUDA 12.8 toolkit at /usr/local/cuda/bin/nvcc
CMake presets and Python builds pin CMAKE_CUDA_COMPILER=/usr/local/cuda/bin/nvcc
If CUDA is unavailable, CMake disables tests, benchmarks, and Python bindings automatically
If build pressure is high, prefer dev/python-dev, keep --parallel low, and set a single CMAKE_CUDA_ARCHITECTURES value for your GPU

Python Bindings

The pybind11 module is exposed as tensorcraft_ops.

python -m pip install -e .
python -c "import tensorcraft_ops as tc; print(tc.__version__)"

Docs

Project docs: https://lessup.github.io/modern-ai-kernels/
Installation: docs/INSTALL.md
Troubleshooting: docs/TROUBLESHOOTING.md
Contribution workflow: CONTRIBUTING.md

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
.kiro/specs		.kiro/specs
.vscode		.vscode
benchmarks		benchmarks
changelog		changelog
docs		docs
examples		examples
include/tensorcraft		include/tensorcraft
src/python_ops		src/python_ops
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SECURITY.md		SECURITY.md
_config.yml		_config.yml
index.md		index.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorCraft-HPC

Repository Overview

Quick Start

Build Presets

Build Notes

Python Bindings

Docs

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TensorCraft-HPC

Repository Overview

Quick Start

Build Presets

Build Notes

Python Bindings

Docs

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages