-
Google
- Greater New York City Area
- linkedin.com/in/yueqi-wang/
- https://yueqiw.github.io
Highlights
- Pro
Stars
Sparse Transition Matrix-Accelerated Trie Index for Constrained Decoding (https://arxiv.org/abs/2602.22647)
An Open Foundation Model and Benchmark to Accelerate Generative Recommendation
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
An open-source AI agent that brings the power of Gemini directly into your terminal.
Implementing DeepSeek R1's GRPO algorithm from scratch
Paper List of Inference/Test Time Scaling/Computing
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Minimal reproduction of DeepSeek R1-Zero
SGLang is a high-performance serving framework for large language models and multimodal models.
Implementation of SVRG for training neural networks
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Processing and analysis of patch clamp electrophysiology data
This repository collects debiasing methods for recommendation
Source code for the X Recommendation Algorithm
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Making large AI models cheaper, faster and more accessible
A personal knowledge management and sharing system for VSCode
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM