Skip to content
View AndSonder's full-sized avatar
🎯
Focusing
🎯
Focusing
  • University of Electronic Science and Technology of China
  • Cheng Du

Highlights

  • Pro

Organizations

@sanyuankexie @neet-cv

Block or report AndSonder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

CUDA Programming

22 repositories

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,407 196 Updated Apr 29, 2021

Sample codes for my CUDA programming book

Cuda 1,649 337 Updated Feb 15, 2025

Hands-On GPU Accelerated Computer Vision with OpenCV and CUDA, published by Packt

C++ 639 230 Updated Jan 30, 2023

Learn CUDA Programming, published by Packt

Cuda 1,111 249 Updated Dec 30, 2023

Serving Inside Pytorch

C++ 155 13 Updated Feb 27, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 96 26 Updated Apr 25, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 6,922 1,133 Updated Feb 28, 2025

An Open Convolutional Neural Network Framework in C++ From Scratch

C++ 60 8 Updated Mar 13, 2021
Cuda 109 29 Updated Apr 11, 2024
Cuda 5 Updated Feb 25, 2025

CUDA project for uni subject

Jupyter Notebook 23 2 Updated Oct 26, 2020

CUDA Gemm Convolution implementation

C++ 6 1 Updated Feb 4, 2022

Fast CUDA matrix multiplication from scratch

Cuda 649 86 Updated Dec 28, 2023

Step-by-step optimization of CUDA SGEMM

Cuda 288 44 Updated Mar 30, 2022
Cuda 5 1 Updated Oct 18, 2024

compiler learning resources collect.

Python 2,289 343 Updated May 27, 2024

Hands-On Practical MLIR Tutorial

C++ 408 59 Updated Oct 20, 2023

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 398 43 Updated Dec 17, 2024

CUDA SGEMM optimization note

Cuda 13 1 Updated Oct 31, 2023

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 297 73 Updated Jan 13, 2025