feat: support retry on broken pipe #481

rad-pat · 2024-09-25T15:19:40Z

We are experiencing many issues calling Databend through the python driver because we have Databend in Kubernetes on spot nodes. The spot nodes can be reaped at any point and when that happens, we get errors such as:

APIError: ResponseError with 1067: transport error, source: Some(tonic::transport::Error(Transport, hyper::Error(Io, Custom { kind: BrokenPipe, error: "stream closed because of a broken pipe" })))

Can we have an option to retry on such errors at the driver level? Possibly even sent in as a config param?
databend://u:p@host/db?retry_on_broken_pipe=3

The text was updated successfully, but these errors were encountered:

everpcpc · 2024-09-26T01:36:09Z

This is not a problem within server and client, but an error between servers in the cluster. It's not safe to simply retry this error with client, since we have no idea about the current query.

Maybe we could retry at server level? cc @zhang2014

zhang2014 · 2024-09-27T00:56:46Z

When only network failures occur and the server is still available, it is safe to retry between nodes. However, if the instance has already been killed, it is not possible to retry.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support retry on broken pipe #481

feat: support retry on broken pipe #481

rad-pat commented Sep 25, 2024

everpcpc commented Sep 26, 2024

zhang2014 commented Sep 27, 2024

feat: support retry on broken pipe #481

feat: support retry on broken pipe #481

Comments

rad-pat commented Sep 25, 2024

everpcpc commented Sep 26, 2024

zhang2014 commented Sep 27, 2024