Claude/add getmetadata method x va jm#5
Merged
Conversation
…tilities - CMakeLists.txt: When ANDROID_ABI is set, skip Java-based OS/arch detection and use the Android ABI directly (arm64-v8a, armeabi-v7a, etc.). Also resolve JNI headers via find_package(JNI) instead of aborting with a fatal error on Android builds. - OSInfo.java: Fix Android arm64 native library path. The Android NDK uses "arm64-v8a" as the ABI directory name (matching jniLibs convention), not "aarch64". The non-Android Linux path is unchanged. - InferenceParameters.java: Add PARAM_CHAT_TEMPLATE constant and setChatTemplate(String) method so callers can pass a custom chat template string to the inference server. - ModelParameters.java: Add isDefault(String key) utility method to check whether a parameter has been explicitly set or is still at its default value. - RerankingModelTest.java: Annotate with @ignore so the reranking tests are skipped in environments where no reranking model is available, preventing spurious CI failures. - ChatExample.java: New interactive chat example demonstrating model loading, conversation history management, and chat-based inference with the LlamaModel API. - .gitignore: Track build-android output directory. - README.md: Add download badge for the pre-built JAR. https://claude.ai/code/session_016atM3vkBsmaia7QGXKex8w
- RerankingModelTest: remove @ignore — tests should run normally. - ChatExample: add @ignore with a comment noting the model file (models/codellama-7b.Q2_K.gguf) is not available in the models directory. https://claude.ai/code/session_016atM3vkBsmaia7QGXKex8w
… generate) ChatMessage and ChatRequest don't exist in the codebase. Replace with the real API: InferenceParameters.setMessages(String, List<Pair>) for building the message history and model.generate() for streaming output, matching the pattern used in MainExample. https://claude.ai/code/session_016atM3vkBsmaia7QGXKex8w
bernardladenthin
added a commit
that referenced
this pull request
May 20, 2026
…edException (#172) Fixes SonarQube java:S2142 reported on isAndroidTermux, getHardwareName, and resolveArmArchType. Each affected catch now calls Thread.currentThread().interrupt() to restore the interrupt flag instead of silently swallowing it. Documented as deviation #5 from the vendored xerial/sqlite-jdbc source. Co-authored-by: Claude <noreply@anthropic.com>
6 tasks
vaiju1981
pushed a commit
to vaiju1981/java-llama.cpp
that referenced
this pull request
Jun 25, 2026
Re-checked all six findings raised after PR bernardladenthin#268 against the pinned upstream tts.cpp @ b9739. Two were false alarms; four were genuine. Verified upstream-faithful (NOT divergences) — added provenance comments only: - bernardladenthin#1 llama_model_n_embd_out: upstream tts.cpp:1042 uses the exact same call. Comment now notes it reads the vocoder OUTPUT embedding width, matching upstream. - bernardladenthin#2 0.25 s silence lead-in: upstream tts.cpp:1077-1080 zeroes the first 0.25 s identically. Comment notes it mirrors upstream and that our `i < audio.size()` bound is an added safety guard over upstream's fixed 24000/4. Genuine findings fixed: - bernardladenthin#3 heavy include in shared header: tts_upstream.h now includes <nlohmann/json_fwd.hpp> (forward-declares ordered_json) instead of the full <nlohmann/json.hpp>, and drops the json default argument. The single caller, tts_engine.cpp, includes the full json and passes an explicit empty object. Future includers of the shared interface no longer pull in ~25k lines of json. - bernardladenthin#4 unasserted duplicate enum: generate-tts-upstream.cmake now captures the upstream `outetts_version` enum body and pins its enumerators + order against the hand-written copy in tts_upstream.h, so a reorder/rename fails the configure instead of silently assigning different integer values across the two TUs. - bernardladenthin#5 prompt_add overload coverage: the bare `void prompt_add(` prefix de-statics all three upstream overloads but only proved >=1 existed. The generator now pins (whitespace-tolerant) both overloads the header declares, turning a future cryptic link error into a clear configure-time failure. - bernardladenthin#6 weak WAV assertion: TtsIntegrationTest now parses the RIFF/WAVE header (PCM format, mono, 24 kHz, 16-bit), checks chunk-size self-consistency, and scans the PCM payload for non-zero samples — so a near-empty or all-silent result no longer passes the way `length > 44` did. Generator regexes validated against the real b9739 source via `cmake -P` (including a negative drift control); Java test compiles; clang-format 22.1.5 and Spotless both clean. Native build not run here (sandbox proxy blocks the dependency FetchContent clones) — exercised in CI. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01QTQ8mBM9tyKkHbXVBpGwET
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PR cloned from kherud#122
Summary
This PR adds Android build support to the CMake configuration, introduces a new chat template parameter for inference customization, adds comprehensive unit tests for the Pair utility class, includes a chat example, and improves documentation.
Key Changes
Android Build Support: Enhanced CMake configuration to detect and handle Android builds via
ANDROID_ABI, automatically setting OS name and architecture without requiring Java execution. Added fallback JNI detection for Android environments.Chat Template Parameter: Added
setChatTemplate(String)method toInferenceParametersto allow custom chat template specification during inference.Pair Class Tests: Added comprehensive
PairTestclass with 18 test cases covering:Chat Example: Added
ChatExampledemonstrating interactive multi-turn chat with message history management.Android Architecture Mapping: Fixed ARM architecture detection for Android to use
arm64-v8ainstead ofaarch64for proper NDK compatibility.Documentation: Added comprehensive
CLAUDE.mdwith project overview, build commands, architecture details, testing guidance, and key constraints.Download Badge: Added Maven artifact download link to README.
Utility Methods: Added
isDefault(String key)method toModelParametersfor checking if a parameter uses its default value.Test Maintenance: Marked
RerankingModelTestas@Ignoreto prevent failures when model files are unavailable.Build Artifacts: Updated
.gitignoreto excludebuild-androiddirectory.Implementation Details
The Android CMake changes use conditional logic to detect
ANDROID_ABIenvironment variable and bypass Java-based OS/architecture detection, which is unavailable in cross-compilation scenarios. The JNI include directory detection also includes an Android-specific fallback using CMake'sfind_package(JNI).The new chat template parameter integrates seamlessly with the existing JSON serialization pattern used by
InferenceParameters.https://claude.ai/code/session_016atM3vkBsmaia7QGXKex8w