model-partitioning

Here are 4 public repositories matching this topic...

Konstantina155 / InferONNX

Lightweight TEE-based system for secure ONNX model inference using Intel SGX with automated model partitioning to fit within enclave memory constraints.

machine-learning inference intel-sgx onnx trusted-execution-environment model-partitioning

Updated Oct 16, 2025
C

vel5id / 3DPrint-Full-Pipeline_Blackwell

Star

Hunyuan3D-2 fork — image→textured 3D→sliced STL + part segmentation. RTX 50-series (Blackwell/sm_120), CUDA 13.0, Python 3.12, PyTorch 2.11+cu130.

pytorch 3d-printing 3d-generation blackwell model-generation python-3-12 hunyuan3d rtx-50-series model-partitioning cuda-13

Updated Jun 8, 2026
Python

kitzbergerg / TUW-master-thesis

Star

Web-Based Distributed LLM Inference

distributed-systems web-assembly webgpu onnx-runtime llm-inference model-partitioning

Updated May 11, 2026
Rust

mihaid150 / distributed-layer-inference

Star

Distributed layer inference for transformer LLMs on edge K3s clusters, with Python/PyTorch and native C++/llama.cpp runtimes, GGUF stage shards, Kubernetes manifests, and an Ops UI for monitoring experiments.

python kubernetes raspberry-pi cpp pytorch edge-computing fastapi edge-ai k3s llama-cpp llm-inference gguf distributed-inference model-partitioning

Updated May 26, 2026
C++

Improve this page

Add a description, image, and links to the model-partitioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-partitioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-partitioning

Here are 4 public repositories matching this topic...

Konstantina155 / InferONNX

vel5id / 3DPrint-Full-Pipeline_Blackwell

kitzbergerg / TUW-master-thesis

mihaid150 / distributed-layer-inference

Improve this page

Add this topic to your repo