azuwis/pianotrans
A GUI application wrapping ByteDance's PyTorch piano transcription model to convert audio/video recordings into MIDI files.

This project provides a simple graphical interface for ByteDance’s High-resolution Piano Transcription system, a PyTorch-based deep learning model. The model regresses onset and offset times to transcribe piano performances with pedal information. Users can select audio or video files, and the system automatically uses GPU for inference to generate MIDI output files. The project includes pre-built packages for Windows and Nix-based deployment.