feat(rag): notify browser of LLM retry/waiting/fallback/warning and search hit count by marevol · Pull Request #3130 · codelibs/fess

marevol · 2026-05-03T03:48:20Z

Summary

Surfaces previously-invisible in-progress events from the AI Search (RAG/Chat) flow to the browser UI via new SSE event types and an extended phase-complete payload.

New SSE events

Event	Payload	When
`retry`	`{phase, operation, attempt, maxAttempts, sleepMs, cause?}`	LLM HTTP call retry (fired once per retry attempt by the LLM plugin)
`waiting`	`{phase, reason, elapsedMs, timeoutMs}`	When `streamChatWithConcurrencyControl` blocks on an exhausted permit
`fallback`	`{phase, reason, originalQuery, newQuery}`	Before re-running search with a regenerated query (`no_results` / `no_relevant_results`)
`warning`	`{phase, code, detail}`	When intent detection silently falls back due to reasoning-model token exhaustion

The existing event: phase (status: complete) payload is now extended for the search phase to include hitCount so the browser can show "N documents found" before the longer evaluate/fetch phases finish.

Changes

LlmStreamCallback — adds default no-op onRetry / onWaiting / onWarning. @FunctionalInterface preserved.
ChatPhaseCallback — adds default no-op onRetry / onWaiting / onFallback / onWarning and a payload-aware onPhaseComplete(String, Map<String,Object>) that defaults to delegating to the legacy single-arg form.
PhaseAwareStreamCallback (new) — bridges LLM-layer LlmStreamCallback events to phase-aware ChatPhaseCallback events.
ChatClient.streamChatEnhanced — wraps every LlmStreamCallback lambda with PhaseAwareStreamCallback, emits onFallback before query regeneration, includes hitCount in search phase completion, fires onWarning when intent detection falls back.
AbstractLlmClient.streamChatWithConcurrencyControl — fires onWaiting when the concurrency permit is unavailable.
IntentDetectionResult — adds isFallback() flag (additive; only fallbackSearch(...) sets true).
ChatApiManager — emits the new SSE events. New emitSseEventSafely + putIfNotNull helpers reduce boilerplate.
chat.js / chat.jsp — adds listeners for the new events and i18n labels.
fess_label*.properties — adds 6 new keys across all 17 language files (English source for non-translated languages, Japanese translations included).

Backwards compatibility

All interface additions are default no-op methods. Existing implementers (including the 3 LLM plugins) compile and behave unchanged. Plugin PRs to actually fire onRetry follow this PR — see "Companion PRs" below.

Companion PRs (must merge AFTER this one and a new SNAPSHOT is published)

codelibs/fess-llm-openai — invoke LlmStreamCallback#onRetry from executeWithRetry
codelibs/fess-llm-ollama — same
codelibs/fess-llm-gemini — same

Until those merge, the retry SSE event will not fire (rest of the events fire from this PR alone).

Test plan

mvn test — 8 touched test classes, 72 tests, 0 failures
- LlmStreamCallbackTest (1)
- ChatPhaseCallbackTest (9 — 7 preexisting preserved + 2 new)
- PhaseAwareStreamCallbackTest (4)
- AbstractLlmClientWaitingTest (2)
- ChatClientFallbackTest (1)
- ChatClientHitCountTest (2)
- ChatClientWarningTest (2)
- ChatApiManagerTest (51 — 47 preexisting preserved + 4 new)
mvn formatter:format && mvn license:format — clean (no-op)
node -c chat.js — JS syntax OK

…seComplete to ChatPhaseCallback

… callback

…ones

…lback

…atClient

…back

…t in phase complete

…unt in chat.js

… chat.js config

… (17 languages)

…hatApiManager

marevol added 18 commits May 3, 2026 11:14

feat(rag): add onRetry/onWaiting/onWarning to LlmStreamCallback

cf540c9

feat(rag): add onRetry/onWaiting/onFallback/onWarning + payload onPha…

0a973cf

…seComplete to ChatPhaseCallback

feat(rag): add PhaseAwareStreamCallback to bridge LLM events to phase…

9c8f533

… callback

test(rag): restore preexisting ChatPhaseCallback tests alongside new …

8a7fd33

…ones

refactor(rag): enforce non-null inner callback in PhaseAwareStreamCal…

369a3b0

…lback

feat(rag): wrap LlmStreamCallback with PhaseAwareStreamCallback in Ch…

a4fe5de

…atClient

feat(rag): emit onFallback before regenerated-query search restart

d7e7e50

feat(rag): include hitCount in search phase completion payload

b10d96b

style(rag): drop redundant java.util.Map FQN (Map already imported)

9df8136

feat(rag): notify onWaiting when concurrency permit unavailable

ffdcd2b

feat(rag): emit onWarning when reasoning model triggers internal fall…

a27d769

…back

style(rag): drop dead callback-null guard before onWarning

6db52c2

feat(rag): emit retry/waiting/fallback/warning SSE events and hitCoun…

6b40ba0

…t in phase complete

test(rag): cover new SSE event types in ChatApiManager

9e77051

feat(rag): handle retry/waiting/fallback/warning SSE events and hitCo…

f12b324

…unt in chat.js

feat(rag): inject retry/waiting/fallback/warning/hitCount labels into…

3da4708

… chat.js config

i18n(rag): add chat retrying/waiting/fallback/warning/hitCount labels…

85a9a96

… (17 languages)

refactor(rag): extract emitSseEventSafely + putIfNotNull helpers in C…

a869fcb

…hatApiManager

marevol added this to the 15.7.0 milestone May 3, 2026

marevol added the improvement label May 3, 2026

marevol self-assigned this May 3, 2026

marevol merged commit a0ce62c into master May 3, 2026
1 check passed

This was referenced May 3, 2026

feat: invoke LlmStreamCallback#onRetry on retry codelibs/fess-llm-ollama#19

Merged

feat: invoke LlmStreamCallback#onRetry on retry codelibs/fess-llm-openai#16

Merged

feat: invoke LlmStreamCallback#onRetry on retry codelibs/fess-llm-gemini#18

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rag): notify browser of LLM retry/waiting/fallback/warning and search hit count#3130

feat(rag): notify browser of LLM retry/waiting/fallback/warning and search hit count#3130
marevol merged 18 commits into
masterfrom
feature/ai-chat-progress-notifications

marevol commented May 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

marevol commented May 3, 2026

Summary

New SSE events

Changes

Backwards compatibility

Companion PRs (must merge AFTER this one and a new SNAPSHOT is published)

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant