Handshake is setting DialTimeout instead of context deadline by NamanMahor · Pull Request #1709 · ClickHouse/clickhouse-go

NamanMahor · 2025-11-11T11:10:24Z

Summary

If a user provides a context.Context with a timeout, it is ignored here in conn_handshake.go even though comment is correct to set context level deadline override any read deadline but instead of deadline we are setting Dialtimeout.

Made change to handle context deadline as we are already doing in here in conn_ping.go

CLAassistant · 2025-11-11T11:10:32Z

All committers have signed the CLA.

kavirajk · 2025-11-17T14:18:06Z

@NamanMahor thanks for the PR :). I know it's a simple change. can you please add tests for this to avoid any regression in the future?

NamanMahor · 2025-11-26T07:31:21Z

@kavirajk sure will add the test. I have one question which could also be bug. we are overriding the ctx with timeout/deadline here https://github.com/ClickHouse/clickhouse-go/blob/main/clickhouse.go#L304 which is being used by ch.dial(ctx) down in the code and later in ch.dial(ctx) will call handshake with same context so effectively we still using the Dial timeout in handshake even after my PR.

kavirajk · 2025-12-16T10:20:30Z

have one question which could also be bug. we are overriding the ctx with timeout/deadline here https://github.com/ClickHouse/clickhouse-go/blob/main/clickhouse.go#L304 which is being used by ch.dial(ctx) down in the code and later in ch.dial(ctx) will call handshake with same context so effectively we still using the Dial timeout in handshake even after my PR.

The way I see it, we have ctx on public APIs like Query(ctx), Exec(ctx) which are use-passed context. And when acquiring connection from the pool, we create "new" context by setting DialTimeout and call the dial. The handshake is happened to be inside the dial method. Whatever the timeout you passed via public APIs would still be in effect after dial to be used in real queries.

i'm curious what is your use case here?. You trying to set this timeout more dynamically only on the handsake of the whole dial method? What is blocking you to use just dialTimeout for the whole dial call? including handsake?

NamanMahor · 2026-02-09T07:38:23Z

@kavirajk sorry about late response totally miss this one. I have added the test.

NamanMahor · 2026-03-03T13:32:29Z

@kavirajk can you please have a look i have added the tests.

NamanMahor · 2026-03-12T06:48:28Z

have one question which could also be bug. we are overriding the ctx with timeout/deadline here https://github.com/ClickHouse/clickhouse-go/blob/main/clickhouse.go#L304 which is being used by ch.dial(ctx) down in the code and later in ch.dial(ctx) will call handshake with same context so effectively we still using the Dial timeout in handshake even after my PR.

The way I see it, we have ctx on public APIs like Query(ctx), Exec(ctx) which are use-passed context. And when acquiring connection from the pool, we create "new" context by setting DialTimeout and call the dial. The handshake is happened to be inside the dial method. Whatever the timeout you passed via public APIs would still be in effect after dial to be used in real queries.

i'm curious what is your use case here?. You trying to set this timeout more dynamically only on the handsake of the whole dial method? What is blocking you to use just dialTimeout for the whole dial call? including handsake?

@kavirajk

At Rill, users configure ClickHouse connections through a UI where they provide the host, port, and other connection details. We commonly see two scenarios:

The user enters a hostname that is valid but not actually running ClickHouse.

The ClickHouse cluster is in a hibernated state and takes time to wake up.

To handle the hibernation case, we need to increase the read timeout so the handshake can wait long enough for the cluster to wake up. However, if we also increase the dial timeout, then the first scenario (wrong host) ends up waiting for the full dial timeout before failing, which leads to a poor user experience.

Ideally, we want the TCP dial to fail fast when the host is incorrect, while still allowing the handshake to wait longer when the cluster is waking up.

If the dial context isn’t reused for the handshake, the handshake can use the user-provided context (which has the longer timeout), and we don’t have to increase the dial timeout. This allows wrong-host cases to fail quickly while still supporting clusters that take longer to respond during handshake.

Handshake is setting DialTimeout instead of context deadline

af46489

kavirajk and others added 2 commits November 12, 2025 12:55

Merge branch 'main' into handshake-timeout-fix

bf11c2d

Merge branch 'main' into handshake-timeout-fix

666891f

Merge branch 'main' into handshake-timeout-fix

f601a1e

NamanMahor added 2 commits December 5, 2025 22:27

Merge branch 'main' into handshake-timeout-fix

b8c1057

Merge branch 'main' into handshake-timeout-fix

2b763d7

Merge branch 'main' into handshake-timeout-fix

2b12efe

NamanMahor requested review from chernser and kavirajk as code owners February 8, 2026 09:17

adding test

8fd8b28

NamanMahor added 2 commits February 17, 2026 17:36

Merge branch 'main' into handshake-timeout-fix

f540502

Merge branch 'main' into handshake-timeout-fix

96c92f0

NamanMahor added 2 commits March 9, 2026 10:49

Merge branch 'main' into handshake-timeout-fix

251fe1f

Merge branch 'main' into handshake-timeout-fix

75ae89f

NamanMahor added 2 commits March 18, 2026 11:02

Merge branch 'main' into handshake-timeout-fix

2bcd699

Merge branch 'main' into handshake-timeout-fix

8a2e9b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handshake is setting DialTimeout instead of context deadline#1709

Handshake is setting DialTimeout instead of context deadline#1709
NamanMahor wants to merge 14 commits intoClickHouse:mainfrom
NamanMahor:handshake-timeout-fix

NamanMahor commented Nov 11, 2025

Uh oh!

CLAassistant commented Nov 11, 2025 •

edited

Loading

Uh oh!

kavirajk commented Nov 17, 2025

Uh oh!

NamanMahor commented Nov 26, 2025

Uh oh!

kavirajk commented Dec 16, 2025

Uh oh!

NamanMahor commented Feb 9, 2026

Uh oh!

NamanMahor commented Mar 3, 2026

Uh oh!

NamanMahor commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

NamanMahor commented Nov 11, 2025

Summary

Uh oh!

CLAassistant commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kavirajk commented Nov 17, 2025

Uh oh!

NamanMahor commented Nov 26, 2025

Uh oh!

kavirajk commented Dec 16, 2025

Uh oh!

NamanMahor commented Feb 9, 2026

Uh oh!

NamanMahor commented Mar 3, 2026

Uh oh!

NamanMahor commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Nov 11, 2025 •

edited

Loading