CS7IS3 Assignment 2 - Evaluation & Leaderboard Guide

This guide shows you how to run the evaluation workflow and automatically submit your MAP scores to the leaderboard.

📋 Required Files

Before running the evaluation, ensure your repository has these files and directories. You can download them from the links below:

Required Files (Root Directory):

✅ pom.xml - Maven project configuration
- 📥 Download pom.xml
✅ topics - Search topics file
- 📥 Download topics
✅ qrels.assignment2.part1 - Relevance judgments for evaluation
- 📥 Download qrels.assignment2.part1

Required Files (Tools Directory):

✅ tools/evaluate.py - Python script for evaluating search results
- 📥 Download evaluate.py

Required Files (GitHub Workflow):

✅ .github/workflows/evaluation.yml - Evaluation workflow file
- 📥 Download evaluation.yml - UPDATED

Required Directory:

✅ Assignment Two/ - Dataset directory containing:
- fbis/ - FBIS documents
- fr94/ - Federal Register documents (subdirectories 01-12)
- ft/ - Financial Times documents (subdirectories ft911-ft944)
- latimes/ - LA Times documents
- dtds/ - DTD files for document parsing
  - fbisdtd.dtd
  - fr94dtd
  - ftdtd
  - latimesdtd.dtd
- 📥 Download Assignment Two dataset

💡 Tip: After downloading, extract the Assignment Two directory to your repository root. For other files, place them in the exact locations shown in the File Structure Reference section below.

Your Java Source Code:

✅ src/main/java/App.java
✅ src/main/java/Indexer.java
✅ src/main/java/Searcher.java

Required for leaderboard submission:

GitHub Secrets: LEADERBOARD_API_URL and LEADERBOARD_API_TOKEN
Optional: TEAM_NAME - Your team name (if not set, repository name will be used)
Optional: TEAM_MEMBERS - Names of team members, comma-separated (e.g., “John Doe, Jane Smith”)

🚀 Quick Start

Step 1: Configure GitHub Secrets (One-time setup)

To submit scores to the leaderboard, add these secrets to your repository:

Go to your repository → Settings → Secrets and variables → Actions
Click “New repository secret” and add:

Secret 1:
- Name: LEADERBOARD_API_URL
- Value: https://leaderboard.qrameez.in
Secret 2:
- Name: LEADERBOARD_API_TOKEN
- Value: (provided by your instructor)
Secret 3 (Optional):
- Name: TEAM_NAME
- Value: Your team name (e.g., “Team Lucene”, “Query Rangers”)
- If not set, your repository name will be used as the team name
Secret 4 (Optional):
- Name: TEAM_MEMBERS
- Value: Names of team members, comma-separated (e.g., “John Doe, Jane Smith, Bob Johnson”)
- If not set, your GitHub username will be used

⚠️ Note: Without LEADERBOARD_API_URL and LEADERBOARD_API_TOKEN, the evaluation will still run but won't submit to the leaderboard.

Step 2: Run the Evaluation

The evaluation runs automatically when you push code to your repository. You can also trigger it manually:

Go to Actions tab
Select “CS7IS3 Assignment 2 - Search Engine Evaluation”
Click “Run workflow” → “Run workflow”

Step 3: Check Results

View workflow output:
- Go to Actions tab → Click the latest workflow run
- Check the “Submit metrics to leaderboard” step (should show ✅)
- View detailed metrics in the workflow summary
View leaderboard:
- Visit: https://leaderboard.qrameez.in
- Find your team and members' names with MAP score

📊 What the Evaluation Does

The workflow automatically:

Validates your project structure (checks for required files)
Builds your project (mvn clean package)
Indexes documents from Assignment Two/ directory
Searches all topics from the topics file
Evaluates results using qrels.assignment2.part1
Submits scores to leaderboard (if secrets are configured)

📝 Understanding Your Scores

The evaluation calculates these metrics:

MAP (Mean Average Precision) - Primary ranking metric (0.0 to 1.0, higher is better)
P@5 - Precision at 5 (fraction of top 5 results that are relevant)

The leaderboard ranks by MAP score.

🔄 Updating Your Score

Every time you:

Push new code
Create a pull request
Manually trigger the workflow

Your leaderboard entry automatically updates with the latest scores. No extra steps needed!

🐛 Troubleshooting

Workflow fails at “Build and Test Search Engine”

Check:

✅ pom.xml exists in root directory
✅ Java source files are in src/main/java/
✅ topics file exists in root directory
✅ Assignment Two/ directory exists with dataset files

Fix: Add missing files and push again.

Workflow fails at “Evaluate Results”

Check:

✅ tools/evaluate.py exists in your repository
✅ qrels.assignment2.part1 file exists in root directory
✅ runs/student.run file was generated (check previous step logs)

Fix: Ensure tools/evaluate.py is present and the search step completed successfully.

“Submit metrics to leaderboard” step is skipped

Problem: Step shows as gray (skipped)

Solution:

Go to Settings → Secrets and variables → Actions
Verify both LEADERBOARD_API_URL and LEADERBOARD_API_TOKEN are set
Ensure URL is exactly: https://leaderboard.qrameez.in (no trailing slash)

Scores show 0.0 on leaderboard

Possible causes:

Search engine didn't produce results
runs/student.run file is empty
Build or search step failed

Solution:

Check workflow logs in the “Build and Test Search Engine” step
Verify your search engine produces output
Ensure evaluation completed successfully

Scores not appearing on leaderboard

Check:

Did “Submit metrics to leaderboard” step succeed? (✅ green checkmark)
Wait 10-30 seconds and refresh the leaderboard page
Verify secrets are correct

If still not working:

Check workflow logs for error messages
Verify API URL in secrets matches: https://leaderboard.qrameez.in

📚 File Structure Reference

Your repository should look like this:

your-repo/
├── pom.xml
├── topics
├── qrels.assignment2.part1
├── src/
│   └── main/
│       └── java/
│           ├── App.java
│           ├── Indexer.java
│           └── Searcher.java
├── Assignment Two/
│   ├── fbis/
│   ├── fr94/
│   │   ├── 01/
│   │   ├── 02/
│   │   └── ... (subdirectories 03-12)
│   ├── ft/
│   │   ├── ft911/
│   │   ├── ft921/
│   │   └── ... (other ft subdirectories)
│   ├── latimes/
│   └── dtds/
│       ├── fbisdtd.dtd
│       ├── fr94dtd
│       ├── ftdtd
│       └── latimesdtd.dtd
├── tools/
│   └── evaluate.py
└── .github/
    └── workflows/
        └── evaluation.yml

🎯 Best Practices

Test locally first:
```
mvn clean package
```
Fix build errors before pushing.
Check workflow logs:
- Always review the Actions tab after pushing
- Look for warnings or errors in each step
Keep secrets secure:
- Never commit secrets to your code
- Never share your LEADERBOARD_API_TOKEN
Push frequently:
- The leaderboard shows your latest successful evaluation
- Push after each improvement to update your score

🔗 Quick Reference

Leaderboard: https://leaderboard.qrameez.in
Workflow: .github/workflows/evaluation.yml
Secrets: Settings → Secrets and variables → Actions
Actions Tab: https://github.com/YOUR_USERNAME/YOUR_REPO/actions

📞 Contact

If you have any questions or encounter issues, please contact:

Rameez Qureshi
📧 moquresh@tcd.ie

Good luck with your search engine implementation! 🚀