
TENT: Technique-Embedded Note Tracking for Real-World Guitar Solo Recordings
By: Ting-Wei Su, Yuan-Ping Chen, Li Su and Yi-Hsuan Yang
References
- Abeßer, J., Lukashevich, H., & Schuller, G. (2010). Feature-based extraction of plucking and expression styles of the electric bass guitar. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 2290–2293. DOI: 10.1109/ICASSP.2010.5495945
- Benetos, E., Dixon, S., Giannoulis, D., Kirchhoff, H., & Klapuri, A. (2013). Automatic music transcription: challenges and future directions. J. Intelligent Information Systems, 41(3), 407–434. DOI: 10.1007/s10844-013-0258-3
- Bittner, R. M., Salamon, J., Essid, S., & Bello, J. P. (2015). Melody extraction by contour classification. In Proc. International Society for Music Information Retrieval Conference, pages 500–506.
- Bogdanov, D., Wack, N., Gómez, E., Gulati, S., Herrera, P., Mayor, O., Roma, G., Salamon, J., Zapata, J., & Serra, X. (2013). Essentia: An audio analysis library for music information retrieval. In Proc. International Society for Music Information Retrieval Conference, pages 493–498. [Online]
http://essentia.upf.edu . DOI: 10.1145/2502081.2502229 - Chang, S., & Lee, K. (2014). A pairwise approach to simultaneous onset/offset detection for singing voice using correntropy. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 629–633. DOI: 10.1109/ICASSP.2014.6853672
- Chen, S.-H., Lee, Y.-S., Hsieh, M.-C., & Wang, J.-C. (2018). Playing technique classification based on deep collaborative learning of variational autoencoder and Gaussian process. In Proc. IEEE International Conference on Multimedia and Expo. DOI: 10.1109/ICME.2018.8486467
- Chen, S.-H., Wu, S.-H., Lee, Y.-S., Lo, R., & Wang, J.-C. (2017). Hierarchical representation based on Bayesian nonparametric tree-structured mixture model for playing technique classification. In Proc. ACM Multimedia Thematic Workshops, pages 537–543. DOI: 10.1145/3126686.3126757
- Chen, Y.-P., Su, L., & Yang, Y.-H. (2015). Electric guitar playing technique detection in real-world recordings based on F0 sequence pattern recognition. In Proc. International Society for Music Information Retrieval Conference, pages 708–714.
- Cheng, T., Dixon, S., & Mauch, M. (2015). Improving piano note tracking by HMM smoothing. In Proc. European Signal Processing Conference, pages 2054–2058. DOI: 10.1109/EUSIPCO.2015.7362736
- Chou, S.-Y., Jang, J.-S., & Yang, Y.-H. (2018). Learning to recognize transient sound events using attentional supervision. In Proc. International Joint Conference on Artificial Intelligence, pages 3336–3342. DOI: 10.24963/ijcai.2018/463
- Dattorro, J. (1997). Effect design, part 2: Delay line modulation and chorus. J. Audio Engineering Society, 45(10), 764–788.
- de Cheveigné, A., & Kawahara, H. (2002). YIN: A fundamental frequency estimator for speech and music. J. Acoustical Society of America, 111(4), 1917–1930. DOI: 10.1121/1.1458024
- de Haas, W. B., Magalhães, J. P., & Wiering, F. (2012). Improving audio chord transcription by exploiting harmonic and metric knowledge. In Proc. International Society for Music Information Retrieval Conference, pages 295–300.
- Dzhambazov, G., Holzapfel, A., Srinivasamurthy, A., & Serra, X. (2017). Metrical-accent aware vocal onset detection in polyphonic audio. arXiv preprint arXiv:1707.06163.
- Fohl, W., & Meisel, A. (2012). A feature relevance study for guitar tone classification. In Proc. International Society for Music Information Retrieval Conference, pages 211–216.
- Gill, D., & Nolan, N. (1997). Rock Lead Basics: Master Class Series. Musicians Institute Press.
- Kehling, C., Abeßer, J., Dittmar, C., & Schuller, G. (2014). Automatic tablature transcription of electric guitar recordings by estimation of score- and instrument-related parameters. In Proc. International Conference on Digital Audio Effects, pages 219–226.
- Kim, J. W., Salamon, J., Li, P., & Bello, J. P. (2018). CREPE: A convolutional representation for pitch estimation. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 161–165. DOI: 10.1109/ICASSP.2018.8461329
- LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proc. IEEE, 86(11), 2278–2324. DOI: 10.1109/5.726791
- Liu, J.-Y., & Yang, Y.-H. (2016). Event localization in music auto-tagging. In Proc. ACM International Conference on Multimedia, pages 1048–1057. DOI: 10.1145/2964284.2964292
- Mauch, M., Cannam, C., Bittner, R., Fazekas, G., Salamon, J., Dai, J., Bello, J., & Dixon, S. (2015). Computer-aided melody note transcription using the Tony software: Accuracy and efficiency. In Proc. International Conference on Technologies for Music Notation and Representation, pages 23–30.
- Mauch, M., & Dixon, S. (2014). pYIN: A fundamental frequency estimator using probabilistic threshold distributions. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 659–663. DOI: 10.1109/ICASSP.2014.6853678
- McFee, B., Raffel, C., Liang, D., Ellis, D. P., McVicar, M., Battenberg, E., & Nieto, O. (2015). LibROSA: Audio and music signal analysis in Python. In Proc. Python in Science Conference, pages 18–25. DOI: 10.25080/Majora-7b98e3ed-003
- Nam, J., Choi, K., Lee, J., Chou, S.-Y., & Yang, Y.-H. (2019). Deep learning for audio-based music classification and tagging: Teaching computers to distinguish Rock from Bach. IEEE Signal Processing Magazine, 36(1), 41–51. DOI: 10.1109/MSP.2018.2874383
- Nishikimi, R., Nakamura, E., Itoyama, K., & Yoshii, K. (2016). Musical note estimation for F0 trajectories of singing voices based on a Bayesian semi-beat-synchronous HMM. In Proc. International Society for Music Information Retrieval Conference, pages 461–467.
- Peeters, G. (2006). Music pitch representation by periodicity measures based on combined temporal and spectral representations. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 53–56. DOI: 10.1109/ICASSP.2006.1661210
- Raffel, C., McFee, B., Humphrey, E. J., Salamon, J., Nieto, O., Liang, D., & Ellis, D. P. (2014). mir_eval: A transparent implementation of common MIR metrics. In Proc. International Society for Music Information Retrieval Conference, pages 367–372.
- Reboursière, L., Lähdeoja, O., Drugman, T., Dupont, S., Picard-Limpens, C., & Riche, N. (2012). Left and right-hand guitar playing techniques detection. In Proc. International Conference on New Interfaces for Musical Expression, pages 7–10.
- Salamon, J., & Gómez, E. (2012). Melody extraction from polyphonic music signals using pitch contour characteristics. IEEE Trans. Audio, Speech & Language Processing, 20(6), 1759–1770. DOI: 10.1109/TASL.2012.2188515
- Salamon, J., Gómez, E., Ellis, D. P., & Richard, G. (2014). Melody extraction from polyphonic music signals: Approaches, applications, and challenges. IEEE Signal Processing Magazine, 31(2), 118–134. DOI: 10.1109/MSP.2013.2271648
- Salamon, J., Peeters, G., & Röbel, A. (2012). Statistical characterisation of melodic pitch contours and its application for melody extraction. In Proc. International Society for Music Information Retrieval Conference, pages 187–192.
- Stein, M. (2010). Automatic detection of multiple, cascaded audio effects in guitar recordings. In Proc. International Conference on Digital Audio Effects, pages 4–7.
- Su, L., Yu, L.-F., & Yang, Y. H. (2014). Sparse cepstral and phase codes for guitar playing technique classification. In Proc. International Society for Music Information Retrieval Conference, pages 9–14.
- Xi, Q., Bittner, R. M., Pauwels, J., Ye, X., & Bello, J. P. (2018). GuitarSet: A dataset for guitar transcription. In Proc. International Society for Music Information Retrieval Conference, pages 453–460.
- Yang, L., Maezawa, A., Smith, J. B. L., & Chew, E. (2017). Probabilistic transcription of sung melody using a pitch dynamic model. In Proc. IEEE International Conference on Acoustics, Speech, & Signal Processing, pages 301–305. DOI: 10.1109/ICASSP.2017.7952166
DOI: https://doi.org/10.5334/tismir.23 | Journal eISSN: 2514-3298
Language: English
Submitted on: Oct 2, 2018
Accepted on: Feb 27, 2019
Published on: Jul 9, 2019
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
Keywords:
© 2019 Ting-Wei Su, Yuan-Ping Chen, Li Su, Yi-Hsuan Yang, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.