Heya PDF Converter

A powerful and versatile tool for converting documents to PDF and Word formats. Heya supports converting HTML, Markdown, WeChat articles to PDF, and PDF to Word documents.

Supports three ways to use: Command Line Interface, Web Interface, and Desktop Application.

Features

Multi-format Conversion: Convert HTML, Markdown, WeChat articles to PDF
PDF to Word: Convert PDF documents to editable Word (.docx) files
Batch Processing: Convert multiple files at once
PDF Merging: Merge multiple PDF files into one
Quality Control: Optional compression with configurable quality levels
Three Interfaces:
- CLI Tool: Perfect for automation and scripts
- Web Interface: User-friendly Gradio UI
- Desktop Application: Native PySide6 application
Multi-language Support: Chinese, English, Korean

Installation

Basic Installation

pip install heya

Install Web Interface

pip install heya[web]

Install Desktop Application

pip install heya[app]

Install All Optional Dependencies

pip install heya[web,app]

Install Playwright Browsers (Required for HTML/WeChat conversion)

playwright install chromium

Quick Start

Command Line Interface

# HTML to PDF
heya html2pdf -i https://example.com -o output.pdf

# Markdown to PDF
heya md2pdf -i README.md -o output.pdf

# WeChat Articles to PDF
heya wechat2pdf -i "https://mp.weixin.qq.com/s/xxx" -o output_dir/

# PDF to Word
heya pdf2word -i document.pdf -o output.docx

Web Interface

heya web

The web server starts on http://127.0.0.1:7860 by default.

Desktop Application

heya app

Usage

Command Line Options

HTML to PDF

heya html2pdf -i <url or file path> -o <output path> [options]

Options:

-i, --input: Source URL or HTML file path (supports multiple inputs)
-o, --output: Output directory path
-t, --timeout: Timeout in seconds (default: 30.0)
-q, --quality: Compression quality: 0=high, 1=medium, 2=low (default: 0)
-m, --merge: Merge all PDFs into one file

Examples:

# Single file
heya html2pdf -i https://example.com -o output.pdf

# Batch conversion
heya html2pdf -i page1.html -i page2.html -i page3.html -o pdfs/

# Merge output
heya html2pdf -i page1.html -i page2.html -o merged.pdf --merge

Markdown to PDF

heya md2pdf -i <file path> -o <output path> [options]

Options:

-i, --input: Markdown file path (supports multiple inputs)
-o, --output: Output directory path
-t, --timeout: Timeout in seconds
-q, --quality: Compression quality
-m, --merge: Merge all PDFs into one file

Examples:

heya md2pdf -i README.md -o output.pdf
heya md2pdf -i *.md -o pdfs/ --merge

WeChat Articles to PDF

heya wechat2pdf -i <wechat link> -o <output directory> [options]

Options:

-i, --input: WeChat article URL
-o, --output: Output directory path
-t, --timeout: Timeout in seconds
-q, --quality: Compression quality
-m, --merge: Merge all PDFs into one file

Supports:

Single article URL: https://mp.weixin.qq.com/s/xxx
Article list URL: https://mp.weixin.qq.com/mp/profile_ext?action=home&__biz=xxx

Examples:

# Convert single article
heya wechat2pdf -i "https://mp.weixin.qq.com/s/xxx" -o articles/

# Convert article list (batch)
heya wechat2pdf -i "https://mp.weixin.qq.com/mp/profile_ext?action=home&__biz=xxx" -o articles/

PDF to Word

heya pdf2word -i <pdf file> -o <output path>

Options:

-i, --input: Source PDF file path
-o, --output: Output Word file path

Example:

heya pdf2word -i document.pdf -o output.docx

Web Interface Features

Multi-tab Interface: Separate tabs for each conversion type
Drag & Drop: Upload files by dragging and dropping
Batch Processing: Convert multiple files at once
PDF Merging: Merge multiple PDFs into a single file
Quality Settings: Configure compression quality and timeout
Multi-language Support: English, Chinese, and Korean
Error Handling: Clear error messages with issue reporting

Desktop Application Features

Native Interface: Built with PySide6 for native desktop experience
Offline Use: Core features work without internet
Multi-language Support: Chinese, English, Korean supported
Real-time Progress: Live conversion progress display

Project Structure

heya/
├── heya/                    # Main package
│   ├── app/                 # Desktop application
│   │   ├── components/     # UI components
│   │   ├── core           # Core components
│   │   ├── handlers       # Event handlers
│   │   ├── i18n           # Internationalization
│   │   ├── services       # Service layer
│   │   └── utils          # Utility functions
│   ├── cmd/                # CLI commands
│   │   └── commands        # Command implementations
│   ├── core/               # Core conversion logic
│   │   ├── browser        # Browser management
│   │   ├── cache          # Caching
│   │   ├── config         # Configuration
│   │   ├── converters     # Converters
│   │   ├── exceptions     # Exceptions
│   │   ├── helpers        # Helper functions
│   │   ├── interfaces     # Interfaces
│   │   ├── logging        # Logging
│   │   ├── markdown       # Markdown processing
│   │   ├── models         # Data models
│   │   ├── pdf            # PDF operations
│   │   ├── performance    # Performance optimization
│   │   ├── stream_converters  # Stream converters
│   │   ├── temp           # Temporary files
│   │   ├── template        # Templates
│   │   └── wechat         # WeChat processing
│   └── web/               # Web application
│       ├── components      # UI components
│       ├── config         # Configuration
│       ├── core           # Core
│       ├── handlers       # Event handlers
│       ├── i18n           # Internationalization
│       ├── services       # Service layer
│       └── utils          # Utility functions
├── pyproject.toml          # Project configuration
└── README.md              # This file

Development

Setup Development Environment

# Clone the project
git clone https://github.com/zkep/heya.git
cd heya

# Sync dependencies with uv
uv sync

# Activate virtual environment
source .venv/bin/activate

Run Tests

uv run pytest

Type Checking

uv run mypy heya

Linting

uv run ruff check heya
uv run ruff format heya

Dependencies

click: Command-line interface framework
playwright: Browser automation for HTML/WeChat conversion
markdown: Markdown parsing
pypdf: PDF manipulation
reportlab: PDF generation and merging
gradio: Web UI (optional)
PySide6: Desktop application (optional)

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

If you encounter any issues or have questions, please open an issue on GitHub.

Acknowledgments

Built with Playwright for reliable browser automation
Web UI powered by Gradio
Desktop application powered by PySide6

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
heya		heya
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README-zh.md		README-zh.md
README.md		README.md
docker-compose.yml		docker-compose.yml
heya.spec		heya.spec
pyinstaller.toml		pyinstaller.toml
pyproject.toml		pyproject.toml
uv.lock		uv.lock
uv.toml		uv.toml

Folders and files

Latest commit

History

Repository files navigation

Heya PDF Converter

Features

Installation

Basic Installation

Install Web Interface

Install Desktop Application

Install All Optional Dependencies

Install Playwright Browsers (Required for HTML/WeChat conversion)

Quick Start

Command Line Interface

Web Interface

Desktop Application

Usage

Command Line Options

HTML to PDF

Markdown to PDF

WeChat Articles to PDF

PDF to Word

Web Interface Features

Desktop Application Features

Project Structure

Development

Setup Development Environment

Run Tests

Type Checking

Linting

Dependencies

License

Contributing

Support

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages