Skip to content
View soldni's full-sized avatar
🏳️‍🌈
vibing!
🏳️‍🌈
vibing!

Organizations

@Georgetown-IR-Lab @allenai

Block or report soldni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An autonomous novel writing pipeline, by Hermes Agent

Python 273 49 Updated Mar 20, 2026

poormanray is a collection of simple tools to manage cloud instances (EC2) and distribute jobs on them

Python 2 Updated Mar 23, 2026

Tools to build fast quality classifiers for Olmo data filtering

Python 3 Updated Mar 16, 2026

My personal website

TeX 1 Updated Mar 23, 2026

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust 76 5 Updated Mar 16, 2026
Python 528 46 Updated Mar 13, 2025

utilities for batched llm calls with retries

Python 49 2 Updated Mar 17, 2026

RSS reader for macOS and iOS.

Swift 9,857 666 Updated Mar 23, 2026

Our library for RL environments + evals

Python 3,928 521 Updated Mar 24, 2026

Code for the paper "BPE stays on SCRIPT"

Python 16 3 Updated Mar 4, 2026

Official Rust Implementation of Model2Vec

Rust 164 15 Updated Mar 22, 2026

📚 Freely available programming books

Python 384,508 66,047 Updated Mar 20, 2026

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 268 17 Updated Jul 8, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

133,007 33,605 Updated Mar 9, 2026

Data mapping framework for rust stuff

Rust 49 4 Updated Mar 24, 2026

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

Python 160 22 Updated Jun 18, 2024

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Python 13,243 819 Updated Mar 24, 2026

PyTorch building blocks for the OLMo ecosystem

Python 986 193 Updated Mar 24, 2026

GhoulBoii's Firefox Dots

CSS 6 Updated Jan 15, 2025

OLMost every training recipe you need to perform data interventions with the OLMo family of models.

Python 67 16 Updated Mar 21, 2026

Curated list of datasets and tools for post-training.

4,360 357 Updated Mar 9, 2026

Versatile typeface for code, from code.

JavaScript 21,907 660 Updated Mar 22, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 48,409 2,105 Updated Mar 24, 2026

😸 Soothing pastel theme for the high-spirited!

TypeScript 18,619 339 Updated Feb 23, 2026

A more intuitive version of du in rust

Rust 11,456 261 Updated Feb 21, 2026

A curated list of resources and examples of ASCII Art

158 11 Updated Apr 24, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,026 233 Updated Feb 9, 2026

Toolkit for linearizing PDFs for LLM datasets/training

Python 17,055 1,363 Updated Mar 23, 2026

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

C++ 832 112 Updated Dec 6, 2025

Large Language Model (LLM) module for the Spezi Ecosystem

Swift 280 43 Updated Mar 21, 2026
Next