DiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion model
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Kim, Hyon
- dc.contributor.author Serra, Xavier
- dc.date.accessioned 2023-08-31T15:36:32Z
- dc.date.available 2023-08-31T15:36:32Z
- dc.date.issued 2023-08-31
- dc.description This work has been accepted at the CMMR2023, the 16th International Symposium on Computer Music Multidisciplinary Research, at Tokyo, Japan. November 13-17, 2023.
- dc.description.abstract In any piano performance, expressiveness is paramount for effectively conveying the intent of the performer, and one of the most significant aspects of expressiveness is the loudness at the individual key or note level. However, accurately detecting note-level loudness poses a considerable technical challenge due to the polyphonic nature of piano performances, wherein multiple notes are played simultaneously, as well as the similarity of harmonic elements. MIDI velocity is crucial for indicating loudness in piano notes. This study conducted experiments for estimating a note-level MIDI velocity expanding the DiffRoll model: the Diffusion Model for piano performance transcription. By adopting double conditioning—audio and score information—and implementing noise removal as a post-processing, our findings highlight the model’s potential in estimating note level MIDI velocity.ca
- dc.description.sponsorship This research was carried out under the project Musical AI - PID2019- 111403GBI00/AEI/10.13039/501100011033, funded by the Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación.
- dc.format.mimetype application/pdf*
- dc.identifier.uri http://hdl.handle.net/10230/57790
- dc.language.iso engca
- dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/PID2019-111403GB-I00
- dc.rights This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).ca
- dc.rights.accessRights info:eu-repo/semantics/openAccessca
- dc.rights.uri https://creativecommons.org/licenses/by/4.0ca
- dc.subject.keyword MIDI Velocity Estimation
- dc.subject.keyword Diffusion Model
- dc.subject.keyword Conditioned Deep Neural Network
- dc.subject.keyword FiLM Conditioning
- dc.title DiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion modelca
- dc.type info:eu-repo/semantics/preprintca
- dc.type.version info:eu-repo/semantics/submittedVersionca