[RFC]: Deprecation and removal for `--engine-use-ray`

### Motivation.

In the `async_engine` code path, we have an option to launch the engine in a separate process using Ray

```python
        parser.add_argument('--engine-use-ray',
                            action='store_true',
                            help='Use Ray to start the LLM engine in a '
                            'separate process as the server process
```

Originally, the option make it possible to separate the server's Python overhead with the engine's main scheduler loop. 

However, few factors made this unused/less popular
* Ray is an optional component, and typically not used in single node environment.
* The serialization and rpc typically offset the theoretical performance gain
* There are typically other ways to isolate server and engine (through multiprocessing, threading, etc).
* Recently, we are separating this in server using lower overhead approaches #6883

### Proposed Change.

Deprecation of the flag with warning for one release. 
Removal of the flag given no major pushbacks. 

### Feedback Period.

1wk

### CC List.

_No response_

### Any Other Things.

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RFC]: Deprecation and removal for `--engine-use-ray` #7045

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[RFC]: Deprecation and removal for --engine-use-ray #7045

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[RFC]: Deprecation and removal for `--engine-use-ray` #7045