Skip to content
View choi95's full-sized avatar
  • United Kingdom

Highlights

  • Pro

Block or report choi95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
C++ 12 Updated Jul 11, 2023

A Heterogeneous Platform Deep Learning Compiler Framework from EdgeCortix

Python 30 3 Updated Aug 2, 2024

Automatic Schedule Exploration and Optimization Framework for Tensor Computations

Python 176 30 Updated Apr 25, 2022

Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into efficient execution plans.

Cuda 60 7 Updated Oct 3, 2022

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

C++ 1,237 186 Updated Apr 14, 2024
C++ 13 4 Updated May 19, 2023

Puzzles for learning Triton

Jupyter Notebook 1,015 65 Updated Sep 25, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,101 2,756 Updated Oct 4, 2024

XLS: Accelerated HW Synthesis

C++ 1,195 174 Updated Oct 4, 2024

📙 Source code for "BenchPress: A Deep Active Benchmark Generator", PACT 2022

Python 21 3 Updated Mar 15, 2023

Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerated runtime for the execution of deep neural networks. With SN…

Python 33 8 Updated Apr 8, 2022

Buda Compiler Backend for Tenstorrent devices

C++ 22 4 Updated Sep 24, 2024

Tenstorrent TT-BUDA Repository

Python 208 28 Updated Sep 24, 2024

The Triton backend for TensorRT.

C++ 60 28 Updated Sep 11, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,134 1,459 Updated Oct 4, 2024

CMSIS-NN Library

C 194 53 Updated Oct 1, 2024

AMD's Machine Intelligence Library

Assembly 1,059 221 Updated Oct 4, 2024

Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.

Jupyter Notebook 202 73 Updated Apr 22, 2019

nGraph has moved to OpenVINO

C++ 1,356 221 Updated Oct 15, 2020

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 14,244 2,870 Updated Oct 4, 2024

Open, Modular, Deep Learning Accelerator

Scala 251 72 Updated Apr 10, 2024

A polyhedral compiler for expressing fast and portable data parallel algorithms

C++ 916 132 Updated Oct 2, 2024

Open Neural Network Compiler

C++ 514 92 Updated Aug 22, 2023

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 743 181 Updated Sep 30, 2024

Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks

Verilog 579 101 Updated Jan 3, 2020

High-performance automatic differentiation of LLVM and MLIR.

LLVM 1,264 108 Updated Oct 4, 2024

Tenstorrent MLIR compiler

C++ 61 7 Updated Oct 4, 2024

SparseTIR: Sparse Tensor Compiler for Deep Learning

Python 131 13 Updated Mar 31, 2023
Next