Skip to content

Conversation

@Syclus123
Copy link

This release adds the following features:

  1. Support screenshots of the evaluation process
  2. Support Online_Mind2Web task evaluation
  3. Support access to gpt-4.1, o3-mini, o4-mini and other models

Tips: To run in a Linux environment without a visual interface, use the following command to start
sudo yum install -y xorg-x11-server-Xvfb
xvfb-run python batch_eval.py

@Syclus123 Syclus123 merged commit 72c256b into iMeanAI:Online-Mind2Web-eval May 21, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant