Skip to content
View jason-dai's full-sized avatar

Block or report jason-dai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Core HW bindings and optimizations for BigDL

C++ 37 32 Updated Nov 24, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 112 5 Updated Apr 27, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,740 1,409 Updated Jan 28, 2026

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,095 5,449 Updated Mar 31, 2026

List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.

104 6 Updated Jun 2, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,108 356 Updated Mar 26, 2026

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 33 6 Updated Mar 18, 2026

Awesome-LLM: a curated list of Large Language Model

26,575 2,408 Updated Jul 31, 2025

Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm

Jupyter Notebook 169 41 Updated Apr 29, 2025

🔥Highlighting the top ML papers every week.

12,271 768 Updated Jul 20, 2025

NOIP, NOI, IOI

Rich Text Format 479 211 Updated Oct 27, 2024

Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6

Jupyter Notebook 560 34 Updated Dec 4, 2023

GCN implementation on top of Apache Spark

Scala 16 3 Updated Oct 30, 2022

常用英语词汇表

2,287 516 Updated May 11, 2024

Simplify your onnx model

C++ 4,311 421 Updated Mar 24, 2026

📕machine learning tech collections at Microsoft and subsidiaries.

453 76 Updated Jan 30, 2023

信息学竞赛,国内官方网站为:

C++ 68 24 Updated Nov 28, 2018

A demo project elaborate how to use intel analytic zoo to train and inference a NCF deep learning model

Python 5 4 Updated Nov 22, 2022

Notebook to train an AI model to detect diseases in Chest Xrays

Jupyter Notebook 8 5 Updated Apr 23, 2019

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,692 732 Updated Mar 18, 2026

A living collection of deep learning problems

HTML 1,730 594 Updated May 3, 2024

Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL

Jupyter Notebook 210 124 Updated Jan 3, 2023

Google hosts generator

Shell 3,434 1,234 Updated Nov 1, 2025