Do I need WebSockets to stream?
No. Streaming uses a long-lived HTTP response, so standard HTTPS works.
Reference
Streaming chat completions send partial tokens over a single HTTP response instead of waiting for the full output.
This improves perceived latency and enables responsive chat UIs.
A streaming chat completion is a response delivered incrementally as a sequence of delta chunks rather than one final message.
curl -N https://api.abliteration.ai/v1/chat/completions \
-H "Authorization: Bearer $ABLIT_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "abliterated-model",
"messages": [{"role":"user","content":"Write a short haiku about the ocean."}],
"stream": true
}'FAQ
No. Streaming uses a long-lived HTTP response, so standard HTTPS works.
Yes. Close the connection or abort the request to stop generation early.
No. Usage is still counted by tokens generated and processed.