Draw and Listen! A Sketch-Based System for Music Inpainting

Christodoulos Benetatos; Zhiyao Duan

doi:10.5334/tismir.128

Draw and Listen! A Sketch-Based System for Music Inpainting

Transactions of the International Society for Music Information Retrieval

Volume 5 (2022): Issue 1

By: Christodoulos Benetatos and Zhiyao Duan

Open Access

|Nov 2022

Adler, A., Emiya, V., Jafari, M. G., Elad, M., Gribonval, R., and Plumbley, M. D. (2012). Audio inpainting. IEEE Transactions on Audio, Speech and Language Processing, 20(3):922–932. DOI: 10.1109/TASL.2011.2168211
Open DOI Search in Google Scholar Back to article
Arpege-Music (2013). Pizzicato notation software. http://www.arpegemusic.com/manual36/EN855.htm. Online; accessed 9 December 2021.
Search in Google Scholar Back to article
Benetatos, C., VanderStel, J., and Duan, Z. (2020). BachDuet: A deep learning system for humanmachine counterpoint improvisation. In Proceedings of the International Conference on New Interfaces for Musical Expression, pages 635–640.
Search in Google Scholar Back to article
Berg, T., Chattopadhyay, D., Schedel, M., and Vallier, T. (2012). Interactive music: Human motion initiated music generation using skeletal tracking by Kinect. In Proceedings of the Conference of the Society for Electro-Acoustic Music in the United States.
Search in Google Scholar Back to article
Chen, K., Wang, C.-i., Berg-Kirkpatrick, T., and Dubnov, S. (2020). Music sketchnet: Controllable music generation via factorized representations of pitch and rhythm. In Proceedings of the 21st International Society for Music Information Retrieval Conference, pages 77–84. ISMIR.
Search in Google Scholar Back to article
Coduys, T. and Ferry, G. (2004). Iannix aesthetical/symbolic visualisations for hypermedia composition. In Journees d’informatique musicale.
Search in Google Scholar Back to article
Cuthbert, M. S. and Ariza, C. (2010). Music21: A toolkit for computer-aided musicology and symbolic music data. In Downie, J. S. and Veltkamp, R. C., editors, Proceedings of the International Society for Music Information Retrieval Conference, pages 637–642.
Search in Google Scholar Back to article
Dannenberg, R. B. and Raphael, C. (2006). Music score alignment and computer accompaniment. Communications of the ACM, 49(8):38–43. DOI: 10.1145/1145287.1145311
Open DOI Search in Google Scholar Back to article
Donahue, C., Simon, I., and Dieleman, S. (2019). Piano Genie. In Proceedings of the 24th International Conference on Intelligent User Interfaces, pages 160–164, New York, NY, USA. Association for Computing Machinery. DOI: 10.1145/3301275.3302288
Open DOI Search in Google Scholar Back to article
Dowling, W. J., Barbey, A., and Adams, L. (1999). Melodic and rhythmic contour in perception and memory. In Yi, S., editor, Music, Mind, and Science, pages 166–188. Seoul National University Press.
Search in Google Scholar Back to article
Farbood, M. M., Pasztor, E., and Jennings, K. (2004). Hyperscore: A graphical sketchpad for novice composers. IEEE Computer Graphics and Applications, 24(1):50–54. DOI: 10.1109/MCG.2004.1255809
Open DOI Search in Google Scholar Back to article
Greshler, G., Shaham, T. R., and Michaeli, T. (2021). Catch-A-Waveform: Learning to generate audio from a single short example. arXiv preprint arXiv:2106.06426.
Search in Google Scholar Back to article
Hadjeres, G. and Nielsen, F. (2020). Anticipation-RNN: Enforcing unary constraints in sequence generation, with application to interactive music generation. Neural Computing and Applications, 32(4):995–1005. DOI: 10.1007/s00521-018-3868-4
Open DOI Search in Google Scholar Back to article
Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., and Lerchner, A. (2016). beta-VAE: Learning basic visual concepts with a constrained variational framework. In 5th International Conference on Learning Representations.
Search in Google Scholar Back to article
Huang, A., Hawthorne, C., Roberts, A., Dinculescu, M., Wexler, J., Hong, L., and Howcroft, J. (2019). Bach Doodle: Approachable music composition with machine learning at scale. In Proceedings of the 20th International Society for Music Information Retrieval Conference (ISMIR).
Search in Google Scholar Back to article
Kingma, D. and Ba, J. (2015). Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR).
Search in Google Scholar Back to article
Kingma, D. P. and Welling, M. (2014). Auto-encoding variational Bayes. In Proceedings of the 2nd International Conference on Learning Representations.
Search in Google Scholar Back to article
Kitahara, T., Giraldo, S., and Ramirez, R. (2018). JamSketch: Improvisation support system with GA-based melody creation from user’s drawing. In Aramaki, M., Davies, M. E. P., Kronland-Martinet, R., and Ystad, S., editors, Music Technology with Swing, pages 509–521. Springer International Publishing. DOI: 10.1007/978-3-030-01692-0_34
Open DOI Search in Google Scholar Back to article
Krumhansl, C. L. and Kessler, E. J. (1982). Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. Psychological Review, 89(4):334. DOI: 10.1037/0033-295X.89.4.334
Open DOI Search in Google Scholar Back to article
Lewis, G. E. (2000). Too many notes: Computers, complexity and culture in Voyager. Leonardo Music Journal, pages 33–39. DOI: 10.1162/096112100570585
Open DOI Search in Google Scholar Back to article
Mao, H. H., Shin, T., and Cottrell, G. (2018). DeepJ: Style-specific music generation. In 2018 IEEE 12^th International Conference on Semantic Computing (ICSC), pages 377–382. IEEE. DOI: 10.1109/ICSC.2018.00077
Open DOI Search in Google Scholar Back to article
Marafioti, A., Majdak, P., Holighaus, N., and Perraudin, N. (2020). GACELA: A generative adversarial context encoder for long audio inpainting of music. IEEE Journal of Selected Topics in Signal Processing, 15(1):120–131. DOI: 10.1109/JSTSP.2020.3037506
Open DOI Search in Google Scholar Back to article
Pati, A., Lerch, A., and Hadjeres, G. (2019). Learning to traverse latent spaces for musical score inpainting. In Proceedings of the 20th International Society for Music Information Retrieval Conference, pages 343–351. ISMIR.
Search in Google Scholar Back to article
Sturm, B. L., Santos, J. F., Ben-Tal, O., and Korshunova, I. (2016). Music transcription modelling and composition using deep learning. Conference on Computer Simulation of Musical Creativity.
Search in Google Scholar Back to article
Thiebaut, J.-B., Healey, P. G., and Bryan-Kinns, N. (2008). Drawing electroacoustic music. In International Computer Music Conference.
Search in Google Scholar Back to article
U&I-Software (1997). Metasynth + Xx. https://uisoftware.com/metasynth/. Online; accessed 9 December 2021.
Search in Google Scholar Back to article
Wuerkaixi, A., Benetatos, C., Duan, Z., and Zhang, C. (2021). Collagenet: Fusing arbitrary melody and accompaniment into a coherent song. In Proceedings of the 22nd International Society for Music Information Retrieval Conference.
Search in Google Scholar Back to article
Xenakis, I. (1977). Upic. https://en.wikipedia.org/wiki/UPIC. Online; accessed 9 December 2021.
Search in Google Scholar Back to article
Yang, R., Wang, D., Wang, Z., Chen, T., Jiang, J., and Xia, G. (2019). Deep music analogy via latent representation disentanglement. In Proceedings of the 20th International Society for Music Information Retrieval Conference, pages 596–603. ISMIR.
Search in Google Scholar Back to article
Yasuhara, A., Fujii, J., and Kitahara, T. (2019). Extending JamSketch: An improvisation support system. In 16th Sound and Music Computing Conference, pages 289–290.
Search in Google Scholar Back to article