DiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion model

dc.contributor.authorKim, Hyon
dc.contributor.authorSerra, Xavier
dc.date.accessioned2023-08-31T15:36:32Z
dc.date.available2023-08-31T15:36:32Z
dc.date.issued2023-08-31
dc.descriptionThis work has been accepted at the CMMR2023, the 16th International Symposium on Computer Music Multidisciplinary Research, at Tokyo, Japan. November 13-17, 2023.
dc.description.abstractIn any piano performance, expressiveness is paramount for effectively conveying the intent of the performer, and one of the most significant aspects of expressiveness is the loudness at the individual key or note level. However, accurately detecting note-level loudness poses a considerable technical challenge due to the polyphonic nature of piano performances, wherein multiple notes are played simultaneously, as well as the similarity of harmonic elements. MIDI velocity is crucial for indicating loudness in piano notes. This study conducted experiments for estimating a note-level MIDI velocity expanding the DiffRoll model: the Diffusion Model for piano performance transcription. By adopting double conditioning—audio and score information—and implementing noise removal as a post-processing, our findings highlight the model’s potential in estimating note level MIDI velocity.ca
dc.description.sponsorshipThis research was carried out under the project Musical AI - PID2019- 111403GBI00/AEI/10.13039/501100011033, funded by the Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación.
dc.format.mimetypeapplication/pdf*
dc.identifier.urihttp://hdl.handle.net/10230/57790
dc.language.isoengca
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/2PE/PID2019-111403GB-I00
dc.rightsThis work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).ca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttps://creativecommons.org/licenses/by/4.0ca
dc.subject.keywordMIDI Velocity Estimation
dc.subject.keywordDiffusion Model
dc.subject.keywordConditioned Deep Neural Network
dc.subject.keywordFiLM Conditioning
dc.titleDiffVel: note-level MIDI velocity estimation for piano performance by a double conditioned diffusion modelca
dc.typeinfo:eu-repo/semantics/preprintca
dc.type.versioninfo:eu-repo/semantics/submittedVersionca

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kim_cmm_diff.pdf
Size:
581.35 KB
Format:
Adobe Portable Document Format
Description:

License

Rights