Skip to content
View ccchengff's full-sized avatar

Highlights

  • Pro

Organizations

@DMALab @PKU-DAIR

Block or report ccchengff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 35 1 Updated Oct 16, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 5 Updated Jun 29, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 3,157 240 Updated Mar 28, 2026

A comprehensive guide for beginners in the field of data management and artificial intelligence.

601 24 Updated Apr 8, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

315 18 Updated Jun 21, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,790 123 Updated Aug 20, 2024

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/P…

Python 23 19 Updated Oct 22, 2025

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

Python 179 16 Updated Jan 19, 2026

A curated reading list of research in Mixture-of-Experts(MoE).

662 46 Updated Oct 30, 2024

[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang…

Python 52 5 Updated May 31, 2023

A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)

Python 157 24 Updated May 10, 2024

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 335 42 Updated Dec 13, 2025

[CVPR 2022] PointCLIP: Point Cloud Understanding by CLIP

Python 409 37 Updated Nov 24, 2022

An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.

Python 64 10 Updated Nov 11, 2025

A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu

Python 124 59 Updated Dec 18, 2023

Towards Generalized and Efficient Blackbox Optimization System/Package (KDD 2021 & JMLR 2024)

Python 433 57 Updated Mar 28, 2026

Generalized and Efficient Blackbox Optimization System.

Python 85 84 Updated Feb 21, 2023

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

275,790 20,922 Updated Aug 22, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,694 27,366 Updated Mar 31, 2026

The Julia Programming Language

Julia 48,531 5,755 Updated Mar 31, 2026

Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)

Python 182 48 Updated Nov 19, 2018

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Python 2,714 623 Updated Nov 22, 2022

😎 Awesome lists about all kinds of interesting topics

450,461 33,843 Updated Mar 9, 2026

A Detailed Cplusplus Concurrency Tutorial 《C++ 并发编程指南》

C++ 5,487 1,484 Updated Feb 27, 2026

LIBSVM -- A Library for Support Vector Machines

Java 4,697 1,639 Updated Dec 29, 2025

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 18,208 3,991 Updated Mar 31, 2026

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 28,199 8,858 Updated Mar 31, 2026

A Flexible and Powerful Parameter Server for large-scale machine learning

Java 6,786 1,591 Updated Oct 13, 2025

The simulator of RISC-V, implemented by C++

Assembly 3 Updated Sep 25, 2017