Voxaboxen
Acoustic classification
Call Identification
Package or Library
Voxaboxen is a deep learning framework designed to find the start and stop times of (possibly overlapping) sound events in a recording.
Model metadata | Value |
---|---|
Tool Type | Package or Library |
Broad Task | Acoustic classification |
Specific Task | Call Identification |
Model type | AVES, Transformer |
Description | Voxaboxen is a deep learning framework designed to find the start and stop times of (possibly overlapping) sound events in a recording. |
Task Specific | Yes |
Ecology Specific | Yes |
Language(s) | Python |
Last Edited Time | 6/21/24 10:24 |
Related Publication(s) | https://doi.org/10.5281/zenodo.8381019 |
Dependencies | PyYAML, einops, intervaltree, librosa, matplotlib, mir_eval, numpy, pandas, plumbum, pytorch, scipy, seaborn, soundfile, torchaudio, tqdm |
Tool URL (Github etc.) | https://github.com/earthspecies/voxaboxen |
Last Update (time ago) | Last updated within 6 months |
License | AGPL-3.0 |
Contact Name | Benjamin Hoffman |
Contact Email | mailto:benjamin@earthspecies.org |
Contact Responsiveness | Very responsive |
HuggingFace URL | nan |
Reproducibility Method | nan |