2014:Audio Fingerprinting

From MIREX Wiki


This task requires the query by using exact but noisy recordings.



  • 10,000 songs (*.wav)
  • mono, 44.1 kHz, 16 bit resolution


  • 1,264 10-second clips
  • mono, 44.1 kHz, 16 bit resolution
  • Recorded by variety brand of smartphones, containing noise

Evaluation Procedures

Top-1 hit rate

Submission Format

Participants are required to submit a breakdown version of algorithm. The two parts are:

1. Database Builder Command format:

builder %file.db.list% %dir_db%

where %file.db.list% is the input list of database audio files named as uniq_key.wav For example:


Output file(s) should be placed into %dir_db%

2. Matcher Command format:

matcher %dir_db% %file.query.list% %resultFile%

where %dir_db% is the directory for the built database. %file.query.list% is the input list of query clips, for example:


The result file gives retrieved result for each query. The format should be:

%main_query_file_name% %main_top_1_candiate_file_name%

For example:

q0001 00204
q0002 08964
q0003 05566

Time and hardware limits

Due to the potentially high number of participants in this and other audio tasks, hard limits on the runtime of submissions are specified. The time/storage limits of different steps are shown in the following table:

Steps Time limit Storage (hard disk) limit
extractor (for database) Totally 72 hours 40KB for every 10 second audio + 250 MB extra overhead (if needed)
extractor (for query) 40 KB for each query clip
matcher N/A

Submissions that exceed these limitations may not receive a result.

Potential Participants


name / email