-
Mila, Université de Montréal
- Montreal, QC, Canada
-
08:14
(UTC -04:00) - https://hiroki11x.github.io/
- @_hiroki11x
- in/hiroki11x
Highlights
Stars
Academic Research Skills for Claude Code: research → write → review → revise → finalize
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
The simplest, fastest repository for training/finetuning small-sized VLMs.
slime is an LLM post-training framework for RL Scaling.
Fast and accurate automatic speech recognition (ASR) for edge devices
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
ImageNet-Sketch data set for evaluating model's ability in learning (out-of-domain) semantics at ImageNet scale
A collection of optimization problems in mathematics
A theory of optimal learning rate schedules in SGD from optimal control theory
Scalable Computing for Advanced Library and Environment
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training (TMLR2025)
Implementatoin for paper: A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias
CellViT: Vision Transformers for Precise Cell Segmentation and Classification
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
Open-source framework for the research and development of foundation models.
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models