Constants§
- ANNOTATION_REQUEST_ ID 
- Dynamo Annotation for the request ID
Functions§
- completion_response_ stream 
- OpenAI Completions Request Handler
- grpc_monitor_ for_ disconnects 
- This method will consume an AsyncEngineStream and monitor for disconnects or context cancellation.
This is gRPC variant of monitor_for_disconnectsas that implementation has SSE specific handling. Should decouple and reusemonitor_for_disconnects