Welcome to the UPF Digital Repository

DiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion model

Show simple item record

dc.contributor.author Kim, Hyon
dc.contributor.author Serra, Xavier
dc.date.accessioned 2023-08-31T15:36:32Z
dc.date.available 2023-08-31T15:36:32Z
dc.date.issued 2023-08-31
dc.identifier.uri http://hdl.handle.net/10230/57790
dc.description This work has been accepted at the CMMR2023, the 16th International Symposium on Computer Music Multidisciplinary Research, at Tokyo, Japan. November 13-17, 2023.
dc.description.abstract In any piano performance, expressiveness is paramount for effectively conveying the intent of the performer, and one of the most significant aspects of expressiveness is the loudness at the individual key or note level. However, accurately detecting note-level loudness poses a considerable technical challenge due to the polyphonic nature of piano performances, wherein multiple notes are played simultaneously, as well as the similarity of harmonic elements. MIDI velocity is crucial for indicating loudness in piano notes. This study conducted experiments for estimating a note-level MIDI velocity expanding the DiffRoll model: the Diffusion Model for piano performance transcription. By adopting double conditioning—audio and score information—and implementing noise removal as a post-processing, our findings highlight the model’s potential in estimating note level MIDI velocity.
dc.description.sponsorship This research was carried out under the project Musical AI - PID2019- 111403GBI00/AEI/10.13039/501100011033, funded by the Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.rights This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
dc.rights.uri https://creativecommons.org/licenses/by/4.0
dc.title DiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion model
dc.type info:eu-repo/semantics/preprint
dc.subject.keyword MIDI Velocity Estimation
dc.subject.keyword Diffusion Model
dc.subject.keyword Conditioned Deep Neural Network
dc.subject.keyword FiLM Conditioning
dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/PID2019-111403GB-I00
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/submittedVersion

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

In collaboration with Compliant to Partaking