This project is a music information retrieval library and research dataset designed for audio feature extraction and music genre classification. It provides a framework for training and evaluating machine learning models that categorize audio tracks into hierarchical genre structures, supported by a collection of open-licensed MP3 tracks and pre-computed features.
The project includes a music metadata API client to fetch structured track, album, and artist information from external data sources. It utilizes these external integrations to map parent-child relationships between genres and organize music libraries.
The system covers a broad range of capabilities including audio feature analysis, music dataset management, and the implementation of baseline models for genre recognition. It also facilitates the downloading and syncing of remote audio files for local offline analysis and the indexing of music collections based on computed audio characteristics.
The project utilizes interactive computation notebooks for feature extraction and the generation of analysis results.