Skip to content

Every time I run run_glue.py I get stuck in the "sock.connect(sa)" of the code at several places. Why? Is it a network problem? #26417

@LLIKKE

Description

@LLIKKE

System Info

check_min_version("4.33.0.dev0")

Who can help?

No response

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

python run_glue.py
--model_name_or_path bert-base-cased
--task_name mrpc
--do_train
--do_eval
--max_seq_length 128
--per_device_train_batch_size 32
--learning_rate 2e-5
--num_train_epochs 5
--overwrite_output_dir
--save_steps 50000
--output_dir checkpoint

Expected behavior

python run_glue.py
--model_name_or_path bert-base-cased
--task_name mrpc
--do_train
--do_eval
--max_seq_length 128
--per_device_train_batch_size 32
--learning_rate 2e-5
--num_train_epochs 5
--overwrite_output_dir
--save_steps 50000
--output_dir checkpoint

I run this code, which is slow when I load it earlier, and when I force the end, it says "connection.py", line 85, in create_connection
sock.connect(sa)
"KeyboardInterrupt" but waiting for a while still works for model training, what's the reason? Is it a network problem?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions