Difference between revisions of "2006:Evalutron6000 Walkthrough For Audio Music Similarity and Retrieval"

From MIREX Wiki
 
Line 1: Line 1:
==UPDATE 23 August 2006==
+
==Welcome to Evalutron 6000==
As of 23 August there is good chance the some of the finer details of the Evalutron 6000 will change in response to community input. One probable change is that the evaluation choices will expand beyond the binary to a tenary scheme for some of the evaluation. Will modify the page once confirmed. Cheers, J. Stephen Downie
 
  
==Welcome to the Evalutron 6000==
+
===Basic Requirements===
 +
In order to use Evalutron 6000 you will need a modern web browser (e.g., Firefox, Mozilla, Safari, Internet Explorer ) that supports JavaScript (ECMAScript) and Cookies. Evalutron has been tested on Windows XP, MacOS X, RedHat Linux, and Solaris. We have found that the combination of Firefox and Flash Player to be most stable across platforms. Despite our best efforts, the use of Internet Explorer continues to generate seemingly random errors and we ask that you avoid using Internet Explorer. A decent amount of bandwidth is also advisable to minimize download times (i.e., DSL, cable modem and better).
  
In order to use the Evalutron 6000 you will need to be using a modern web browser (e.g., Firefox, Internet Explorer, Safari, Mozilla, etc) that supports JavaScript (ECMAScript) and Cookies. Evalutron has been tested on Windows XP, MacOS X, and RedHat Linux. If you are using a different platform and having trouble, please try accessing Evalutron 6000 from another machine. If you are still having difficulty, contact
+
A screen resolution of 1024 X 768 the minimal advised. Higher resolutions appear to work well too. We have noted some odd behaviour if you adjust your resolution in mid-session, so probably best to not to do this. Adjusting font size via you browser can sometimes make things look a bit tidier and does not seem to have adverse effects.
 +
 
 +
In general, if you are having trouble, please try accessing the Evalutron 6000 using another machine/platform/browser/player combination. If you are still having difficulty, please contact
  
 
mrx-com09@lists.lis.uiuc.edu.
 
mrx-com09@lists.lis.uiuc.edu.
  
 +
 +
===Getting Started===
 
When first visiting the Evalutron 6000 homepage, you will see a page similar to this (Fig. 1).  
 
When first visiting the Evalutron 6000 homepage, you will see a page similar to this (Fig. 1).  
  
[[Image:Eval6_home.png]]
+
[[Image:E6ksms_home_page_scaled.png]]
  
 
'''Figure 1. Evalutron 6000 start page.'''
 
'''Figure 1. Evalutron 6000 start page.'''
  
First you must register a new account. Click on the "Register" link on the left side of the page to create an account.
+
First you must register a new account. Click on the "Register" link (listed under Step 2) on the page to create an account.
  
The registration page is fairly straightforward (Fig. 2). Required fields are marked in blue with an asterisks. You can create any username and password you wish. Passwords must be at least 6 characters long and are case-sensitive. Before completing the registration, you must read and agree to the terms of the Informed Consent document. The evaluation, because it is using human judgements of similarity, is considered a human-subjects research project and the Evaluatron is basically a survey instrument. To indicate your consent to participate in the evaluation, check the "I Agree" checkbox below the informed consent document.
+
The registration page is fairly straightforward (Fig. 2). Required fields are marked in blue with asterisks. You can create any username and password you wish. Passwords must be at least 6 characters long and are case-sensitive. Before completing the registration, you must read and agree to the terms of the Informed Consent document. The evaluation, because it is using human judgements of similarity, is considered a human-subjects research project and the Evaluatron is basically a survey instrument. To indicate your consent to participate in the evaluation, check the "I Agree" checkbox below the informed consent document.
  
 
If you have questions about your rights as a subject in this research project, you should contact the UIUC IRB office (http://www.irb.uiuc.edu) for more information. The research protocol for this project is IRB# 07066.
 
If you have questions about your rights as a subject in this research project, you should contact the UIUC IRB office (http://www.irb.uiuc.edu) for more information. The research protocol for this project is IRB# 07066.
  
[[Image:Eval6_register.png]]
+
[[Image:E6ksms_registration_page_scaled.png]]
 +
 
 +
'''Figure 2. Evalutron 6000 registration page.'''
  
'''Figure 2. Evalutron 6000 start page.'''
 
  
After completing the Registration, the system will ask you to log-in with your newly created username and password. After logging-in, you can begin evaluation (Fig. 3). To start the evaluation process, click on the "Start Evaluation" link on the homepage. You will be presented with several options for what media player you want to use. We have invested considerable effort coding various interfaces to be compatible with the maximum number of browsers, platforms, and players. Most users should be able to use the Flash MP3 Player option -- which we've found to work the best. The Windows Media Player and Quicktime options should also work in most modern browsers.  
+
===Selecting Your Audio Player===
 +
After completing the Registration, the system will ask you to sign-in with your newly created username and password. After signing-in, you can begin evaluation (Fig. 3). To start the evaluation process, click on the "Start Evaluation" link on the homepage. You will be presented with several options for what media player you want to use. We have invested considerable effort coding various interfaces to be compatible with the maximum number of browsers, platforms, and players. Most users should be able to use the Flash MP3 Player option -- which we've found to work the best. The Windows Media Player and Quicktime options should also work in most modern browsers. We have noted that the Media Player does introduce some start-up lags when starting a new query set which we believe is caused by the way Media Player downloads the MP3 files.
  
[[Image:Eval6_start.png]]
+
[[Image:E6ksms_options_page_scaled.png]]
  
 
'''Figure 3. Audio player selection page.'''
 
'''Figure 3. Audio player selection page.'''
  
Once you have selected a player configuration, you'll see a query evaluation page (Fig. 4). This page consists of two main parts - the query player and candidate players at the top, and the candidate list below. Each song (query and candidates) has been split into three 30 second clips. The first clip is taken from the first 30 seconds of the audio file (0:00 - 0:30). The second clip is the middle 30 seconds of the song (middle-0:15 - middle+0:15). The third clip is the last 30 seconds of the song (end-0:30 - end). Clicking on these buttons will load the song into the player and begin playing. Depending on the player you are using and your browser's configuration, you should be able to seek through the 30 second clip and replay it using the player's play controls. We recommend you listen to the entire query at least once before evaluating any candidate files.
 
  
[[Image:Eval6_eval_page.png]]
+
===Evaluation Pages: Queries and Candidate Lists===
 +
Once you have selected a player configuration, you'll see a query evaluation page (Fig. 4). On this page, you will see the query player and the list of candidate players. The queries and candidates drawn from the RISM collection are fairly short (Queries #1 through #6). We recommend you listen to the entire query at least once before evaluating any candidate files.
 +
 
 +
[[Image:E6ksms_evaluation_page_scaled.png]]
  
 
'''Figure 4. Sample evaluation page.'''
 
'''Figure 4. Sample evaluation page.'''
  
Please note that the list of candidates scrolls within the page, there are more candidates than may be immediately visible on the page. Please scroll to the bottom of the candidate list to make sure you've evaluated each song.
+
Please do not be puzzled if your first query assignment is not Query #1 or if your Candidate List has candidates in seemingly random order: This is deliberate! The Evalutron 6000 is designed to build customized randomized lists and orderings for each grader to minimize fatigue and ordering effects. While you have the right to jump around in the system to do your evaluations in any order you like, we would appreciate it if you followed the suggested ordering given by the Evalutron 6000's randomizer. The "Previous Query" and "Next Query" buttons visible in Figure 4 are designed to guide you through the randomized Query List ordering.
 +
 
 +
Please note that the list of candidates is longer than the visible page (i.e., there are more candidates than may be immediately visible on the page). Please scroll to the bottom of the candidate list to make sure you have evaluated each song.
 +
 
 +
The procedure for listening to a candidate is the same as listening to a query -- click the play buttons to load the clip into the player and listen to it (Fig. 5). Once you have a feeling for how similar the candidate is to the query, click the "Not Similar," "Somewhat Similar" or "Very Similar" radio buttons to the right of the candidate. Note: you can continue to replay the query during candidate playback.
 +
 
 +
[[Image:E6ksms_select_category.png]]
 +
 
 +
'''Figure 5. Close up image of Broad Category selection buttons.'''
  
The procedure for listening to a candidate is the same as listening to a query -- click the 0:30-clip buttons to load the clip into the player and listen to it (Fig. 5). Once you have a feeling for whether or not the candidate is similar to the query, click the "Similar" or "Not Similar" radio buttons to the right of the candidate. Note: you can continue to replay the query during candidate playback.
+
Depending on how you graded the candidate, you should see the candidate box react to your grade, indicating that the vote has been logged in the database (Fig. 6). If you indicated the song was "Somewhat Similar" the box will turn yellow and state "SAVED". If you indicated the song was "Very Similar" the box will turn green, "Not Similar" the box will turn red, and state "SAVED".  
  
[[Image:Eval6_cand_detail.png]]
+
[[Image:E6ksms_select_category_saved.png]]
  
'''Figure 5. Close up image of candidate manipulation buttons.'''
+
'''Figure 6. Close up image of Broad Category selection buttons with "Somewhat Similar" selected and "SAVED" automatically.'''
  
Depending on how you graded the candidate, you should see the candidate box react to your grade, indicating that the vote has been logged in the database (Fig. 6). If you indicated the song was "Similar" the box will turn green and state "SAVED".
+
We also need each grader to assign a fine-grained score for the similarity of the candidate to the query on the scale of 0-10 (Fig. 7). More information about this process below.
  
[[Image:Eval6_sim_saved.png]]
+
[[Image:E6ksms_select_score.png]]
  
'''Figure 6. Close up image of candidate manipulation buttons with "Similar" selection "SAVED".'''
+
'''Figure 7. Close up image of the Fine Score selection scale.'''
  
If you indicated the song was "Not Similar" the box will turn red and state "SAVED" (Fig. 7).  
+
The system will automatically record the score when you let go of the scaler (Fig. 8).  
  
[[Image:Eval6_notsim_saved.png]]
+
[[Image:E6ksms_select_score_saved.png]]
  
'''Figure 7. Close up image of candidate manipulation buttons with "Not Similar" selection "SAVED".'''
+
'''Figure 8. Close up image of a Fine Score selection scale with a random score "SAVED".'''
  
You can always change your evaluation for any candidate by toggling the radio buttons. The system should reflect any changes with an appropriate color change. Once an evaluation has been made, however, it cannot be retracted, only reversed. (i.e., you cannot "unvote").
+
You can always change your evaluation for any candidate by toggling the radio buttons. You can also go back and adjust the Fine Score selection scale.  
  
When you have completed evaluating all of the candidates for a single query, you can click on the "Next Random Query" button at the bottom of the page (Fig. 8). This button will load a new query and list of candidates for you to evaluate.  
+
When you have completed evaluation all of the candidates for a single query, you can click on the "Next Query" button at the top right of the evaluation page. This button will load a new Query and associated Candidate List for you to evaluate. Using the "My Assignment" tab, you can check the list of queries and candidates that were assigned to you, and also see how far you are in completing your evaluation task (Fig. 9).
  
[[Image:Eval6_next_button.png]]
+
[[Image:E6ksms_my_assignment_scaled.png]]
  
'''Figure 8. Close up image of "Next Random Query" button which generates next set of candidates for evaluation. '''
+
'''Figure 9. My Assignment page. '''
  
You will see a list of all of the queries you have evaluated (or are evaluating) at the bottom of the page. You can return to any query by clicking on the button for that query here (Fig. 9). You can re-evaluate any candidate for any query at any time, up to the closing of the evaluation system.
+
You can also see a list of all of the queries you have been assigned at the bottom of each Candidate List page. You can return to any query by clicking on the button for that query here (Fig. 10). You can re-evaluate any candidate for any query at any time, up to the closing of the evaluation system.
  
 
[[Image:Eval6_queries_detail.png]]
 
[[Image:Eval6_queries_detail.png]]
  
'''Figure 9. Close up image of Query List buttons which allow for revisiting of completed query/candidate sets. '''
+
'''Figure 10. Close up image of Query List buttons which allow for revisiting of completed query/candidate sets. '''

Revision as of 14:46, 30 August 2006

Welcome to Evalutron 6000

Basic Requirements

In order to use Evalutron 6000 you will need a modern web browser (e.g., Firefox, Mozilla, Safari, Internet Explorer ) that supports JavaScript (ECMAScript) and Cookies. Evalutron has been tested on Windows XP, MacOS X, RedHat Linux, and Solaris. We have found that the combination of Firefox and Flash Player to be most stable across platforms. Despite our best efforts, the use of Internet Explorer continues to generate seemingly random errors and we ask that you avoid using Internet Explorer. A decent amount of bandwidth is also advisable to minimize download times (i.e., DSL, cable modem and better).

A screen resolution of 1024 X 768 the minimal advised. Higher resolutions appear to work well too. We have noted some odd behaviour if you adjust your resolution in mid-session, so probably best to not to do this. Adjusting font size via you browser can sometimes make things look a bit tidier and does not seem to have adverse effects.

In general, if you are having trouble, please try accessing the Evalutron 6000 using another machine/platform/browser/player combination. If you are still having difficulty, please contact

mrx-com09@lists.lis.uiuc.edu.


Getting Started

When first visiting the Evalutron 6000 homepage, you will see a page similar to this (Fig. 1).

File:E6ksms home page scaled.png

Figure 1. Evalutron 6000 start page.

First you must register a new account. Click on the "Register" link (listed under Step 2) on the page to create an account.

The registration page is fairly straightforward (Fig. 2). Required fields are marked in blue with asterisks. You can create any username and password you wish. Passwords must be at least 6 characters long and are case-sensitive. Before completing the registration, you must read and agree to the terms of the Informed Consent document. The evaluation, because it is using human judgements of similarity, is considered a human-subjects research project and the Evaluatron is basically a survey instrument. To indicate your consent to participate in the evaluation, check the "I Agree" checkbox below the informed consent document.

If you have questions about your rights as a subject in this research project, you should contact the UIUC IRB office (http://www.irb.uiuc.edu) for more information. The research protocol for this project is IRB# 07066.

File:E6ksms registration page scaled.png

Figure 2. Evalutron 6000 registration page.


Selecting Your Audio Player

After completing the Registration, the system will ask you to sign-in with your newly created username and password. After signing-in, you can begin evaluation (Fig. 3). To start the evaluation process, click on the "Start Evaluation" link on the homepage. You will be presented with several options for what media player you want to use. We have invested considerable effort coding various interfaces to be compatible with the maximum number of browsers, platforms, and players. Most users should be able to use the Flash MP3 Player option -- which we've found to work the best. The Windows Media Player and Quicktime options should also work in most modern browsers. We have noted that the Media Player does introduce some start-up lags when starting a new query set which we believe is caused by the way Media Player downloads the MP3 files.

File:E6ksms options page scaled.png

Figure 3. Audio player selection page.


Evaluation Pages: Queries and Candidate Lists

Once you have selected a player configuration, you'll see a query evaluation page (Fig. 4). On this page, you will see the query player and the list of candidate players. The queries and candidates drawn from the RISM collection are fairly short (Queries #1 through #6). We recommend you listen to the entire query at least once before evaluating any candidate files.

File:E6ksms evaluation page scaled.png

Figure 4. Sample evaluation page.

Please do not be puzzled if your first query assignment is not Query #1 or if your Candidate List has candidates in seemingly random order: This is deliberate! The Evalutron 6000 is designed to build customized randomized lists and orderings for each grader to minimize fatigue and ordering effects. While you have the right to jump around in the system to do your evaluations in any order you like, we would appreciate it if you followed the suggested ordering given by the Evalutron 6000's randomizer. The "Previous Query" and "Next Query" buttons visible in Figure 4 are designed to guide you through the randomized Query List ordering.

Please note that the list of candidates is longer than the visible page (i.e., there are more candidates than may be immediately visible on the page). Please scroll to the bottom of the candidate list to make sure you have evaluated each song.

The procedure for listening to a candidate is the same as listening to a query -- click the play buttons to load the clip into the player and listen to it (Fig. 5). Once you have a feeling for how similar the candidate is to the query, click the "Not Similar," "Somewhat Similar" or "Very Similar" radio buttons to the right of the candidate. Note: you can continue to replay the query during candidate playback.

File:E6ksms select category.png

Figure 5. Close up image of Broad Category selection buttons.

Depending on how you graded the candidate, you should see the candidate box react to your grade, indicating that the vote has been logged in the database (Fig. 6). If you indicated the song was "Somewhat Similar" the box will turn yellow and state "SAVED". If you indicated the song was "Very Similar" the box will turn green, "Not Similar" the box will turn red, and state "SAVED".

File:E6ksms select category saved.png

Figure 6. Close up image of Broad Category selection buttons with "Somewhat Similar" selected and "SAVED" automatically.

We also need each grader to assign a fine-grained score for the similarity of the candidate to the query on the scale of 0-10 (Fig. 7). More information about this process below.

File:E6ksms select score.png

Figure 7. Close up image of the Fine Score selection scale.

The system will automatically record the score when you let go of the scaler (Fig. 8).

File:E6ksms select score saved.png

Figure 8. Close up image of a Fine Score selection scale with a random score "SAVED".

You can always change your evaluation for any candidate by toggling the radio buttons. You can also go back and adjust the Fine Score selection scale.

When you have completed evaluation all of the candidates for a single query, you can click on the "Next Query" button at the top right of the evaluation page. This button will load a new Query and associated Candidate List for you to evaluate. Using the "My Assignment" tab, you can check the list of queries and candidates that were assigned to you, and also see how far you are in completing your evaluation task (Fig. 9).

File:E6ksms my assignment scaled.png

Figure 9. My Assignment page.

You can also see a list of all of the queries you have been assigned at the bottom of each Candidate List page. You can return to any query by clicking on the button for that query here (Fig. 10). You can re-evaluate any candidate for any query at any time, up to the closing of the evaluation system.

File:Eval6 queries detail.png

Figure 10. Close up image of Query List buttons which allow for revisiting of completed query/candidate sets.