Project Ideas

Possible Topics

  • Identifing speaker accents
  • Chaos Game Representation to visualize sequences
  • Musical Phrase identification
  • Melody Transcription
  • Time Signature Identification
  • Audio Score Following
  • Neural Nets as Classifiers
  • Low-Frequency Realtime Pitch Tracking

Details

Musical Phrase Identification

  • Identify beginning and end of phrase
  • Definitely easiest with monophonic music
  • for polyphonic music, phrase identification could follow melody transcription
  • probably need more features than what the melody transcriber gives. Mix original audio with "cooked" output from melody transcriber

Melody Transcription

  • Naive approach: look for loudest spectral peak
  • Look for harmonically-related peaks to find a fundamental pitch
  • This seems like a relatively obvious/well researched problem. What about related issues?
    • catching presence/absence of main melodic line. what do melody-extractors do when the melody takes a break?
    • identify changes in instrumentation
  • References
    • Durey, A. S., and Clements, M. A. Features for Melody Spotting Using Hidden Markov Models. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
    • Herrera, P., Amatriain, X., Batlle, E., and Serra. X. Towards Instrument Segmentation for Music Content Description: A Critical Review of Instrument Classification Techniques. In Proceedings of the International Symposium on Music Information Retrieval, 2000
    • http://www.music-ir.org/mirex/2005/index.php/Audio_Melody_Extraction

Audio Score Following

  • Perhaps focusing on following an inexact performance
  • performance mistakes or semi-improvised parts?
  • polyphonic audio sources seem less explored, but in a performance environment it seems that you could easily put a microphone on a single instrument to get a monophonic signal for tracking, so how necessary is it?
  • What kind of training data would I need? Not much available for a specific live performance

Neural Nets as Classifiers

  • Could be incorporated in another project
  • Look up DAn's work at Berkely
  • Are there comparisons of ANNs vs. SVMs?

Low-Frequency Realtime Pitch Tracking

  • Upright bass fundemental frequency goes down to about 40 Hz

General Notes

  • What could we do if given a transcription or aligned score?
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License