Establishing Music Information Retrieval (MIR) and Music Digital Library (MDL) Evaluation Frameworks: 
Preliminary Foundations and Infrastructures


Greetings Colleagues:

This background document is presented to you with the hope that it will contextualize for you the strategies and long-term goals of the ongoing effort to establish robust and meaningful evaluation frameworks for MIR and MDL research. Once you have familiarized yourself with its contents, please jump to

http://www.music-ir.org/evaluation/JCDL_Workshop_Info.html

where you find more specific information about "White paper" submissions and/or participation in the Workshop on the Creation of Standardized Test Collections, Tasks, and Metrics for Music Information Retrieval (MIR) and Music Digital Library (MDL) Evaluation. Joint ACM/IEEE Conference on Digital Libraries, Portland, OR, July 14-18, 2002.

If you have any comments, suggestions or questions please contact me, J. Stephen Downie, at jdownie@uiuc.edu.

Problem Statement:

There is a very strong feeling among the MIR and MDL research communities that we must acquire standardized collections of music (in multiple representations and formats), standardized retrieval tasks (i.e., query sets, etc.), and standardized retrieval and evaluation metrics (please see http://www.music-ir.org/mirbib2/resolution). At the heart of all of this is the very real concern that current MIR/MDL research falls short of being objective and scientific insofar as its work is represented in the literature. One model being put forward is the formation of a TREC-like (see http://trec.nist.gov) set of collections and evaluation criteria. These collections and criteria will have to be designed specifically to address the wide variety of music-specific problems facing the MIR/MDL research communities. Establishing the ideal framework for developing these collections and criteria is the open question to be addressed over the period of this project.

Project Goals:

To establish the infrastructural foundation for the formation of meaningful and comprehensive MIR/MDL evaluation through the identification and/or creation of standardized test collections, retrieval tasks and performance metrics.

Principal Project Components:

There are three principal components to this project
  1. Workshop on the Creation of Standardized Test Collections, Tasks, and Metrics for Music Information Retrieval (MIR) and Music Digital Library (MDL) Evaluation. Joint ACM/IEEE Conference on Digital Libraries, Portland, OR, July 14-18, 2002. [Hereafter, the "Workshop"] Specifics found at: http://www.ohsu.edu/jcdl/ws.html#W4.
  2. Music Information Retrieval Evaluation Frameworks. (ISMIR 2002, 3rd International Conference on Music Information Retrieval, IRCAM – Centre Pompidou, Paris, France, October 13-17, 2002.) [Hereafter, the "Panel"] Specifics found at: http://ismir2002.ircam.fr/panels.html#panel1.
  3. Information Dissemination and Intelligence Gathering (Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, June 2002 to January 2003 [tentative])

Project Plans:

JCDL Workshop: Communication will be grounded in the creation of "White papers" or "Position papers". These documents will be scholarly works outlining the thoughts, opinions, and arguments of their authors with regard to their visions of what needs to be included/considered in the creation of standardized MIR/MDL testbed collections, evaluation tasks, and evaluation metrics. I expect (and want) these documents to be highly idiosyncratic: a musicologist will write with respect to the needs and wants of musicologists, a lawyer with respect to legal issues, an audio engineer with regard to audio issues, and so on. It is my goal to attract such a wide range of submissions that the multi-disciplinary approach to MIR /MDL research is truly represented.

Dr. Ellen Vorhees of the National Institute of Standards and Technology has kindly consented to be our keynote speaker on the subject of TREC-like evaluation scenarios.

Goals of the JCDL Workshop include:

  1. The creation of a reference document. That is, the collection of our submitted papers would form the starting-point literature from which future arguments and work in the area of MIR/MDL evaluation could proceed.
  2. A consensus document or "manifesto" outlining the basic framework for further development of collections and evaluation standards would be the "formal" outcome of the workshop itself.
  3. Both documents mentioned above would be made available for all to use as foundational material for the solicitation of research funding, for the establishment of collection creation agreements, and for the creation of ongoing evaluation meetings and "contests."
Panel: The ISMIR 2002 Panel will build upon and extend the findings of the JCDL Workshop. I hope to have this panel session lead to the development of prototype collections, tasks, and metrics that could be unveiled at ISMIR 2003. To this end, participants will include a) those who will be reporting on, and extending, the findings of the JCDL Workshop; and, b) those who were unable to contribute to the JCDL Workshop so new viewpoints and issues may be considered. Submissions, notes and revisions of earlier "White papers" will be incorporated into the preceding JCDL collection, perhaps as Volume II or as a "second edition."

Information Dissemination and Intelligence Gathering: This overarching component will be the "glue" that cements the efforts and results of the other two components together to form the solid foundation for MIR/MDL evaluation.

The dissemination sub-component includes the solicitation, formatting and editing of the "White papers" (from both the JCDL Workshop and the ISMIR Panel) for both print and WWW-based distribution (to be centrally housed at http://www.music-ir.org). It also includes incorporating the results of the discussions undertaken at the meetings and the results of whatever intelligence has been gathered into the main body of documentation.

The intelligence sub-component includes the preliminary identification and analyses of possible extant collections and evaluation issues that might meet or hinder the needs of MIR/MDL evaluation. Issues to be addressed include:

  1. Format availability (i.e., audio, notation, etc.)
  2. Coverage (i.e., genre and/or cultural coverage of the possible collection(s))
  3. Identification of stakeholders (i.e., owners of possible collections, etc.)
  4. Legal issues (i.e., copyright concerns, ownership and dissemination issues, possible licensing scenarios, etc.)
  5. Research issues (i.e., human-subjects concerns, ethics, etc.)
  6. Infrastructural issues (i.e., where to house collections, rights management issues, access issues, etc.)

music-ir.org is hosted by the ISRL (Information Science Research Laboratories) which is part of GSLIS (the Graduate School of Library and Information Science) at UIUC (the University of Illinois at Urbana-Champaign).

J. Stephen Downie
15 May 2002