mariaalfaroc / late-fusion-music-transcription Public

Notifications You must be signed in to change notification settings
Fork 2
Star 2

Late-fusing OMR and A2S predictions using four different algorithms

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
confusion_networks		confusion_networks
scenarios		scenarios
word_graphs		word_graphs
5-crossval.tgz		5-crossval.tgz
LICENSE		LICENSE
README.md		README.md
config.py		config.py
data_processing.py		data_processing.py
evaluation.py		evaluation.py
experimentation.py		experimentation.py
kaldi_preprocessing.py		kaldi_preprocessing.py
main.py		main.py
models.py		models.py

Repository files navigation

Late multimodal fusion for image and audio music transcription

Code for the paper:

María Alfaro-Contreras, Jose J. Valero-Mas, Jose M. Iñesta, Jorge Calvo-Zaragoza
Late multimodal fusion for image and audio music transcription
Expert Systems with Applications, 216, 119491, 2023

Dataset used: Camera-PrIMuS. Available here. The partitions used can be found in the 5-crossval.tgz file.

Citation

@article{alfaro2023late,
  author       = {Alfaro-Contreras, Mar{\'i}a and Valero-Mas, Jose J. and I{\~n}esta, Jose M. and Calvo-Zaragoza, Jorge},
  title        = {{Late multimodal fusion for image and audio music transcription}},
  journal      = {Expert Systems with Applications},
  volume       = {216},
  pages        = {119491},
  year         = {2023},
  issn         = {0957-4174}
}