Skip to content

Create About Golden QnA.md#623

Merged
mahajantejas merged 2 commits into
mainfrom
About-Golden-QnA.md
May 28, 2026
Merged

Create About Golden QnA.md#623
mahajantejas merged 2 commits into
mainfrom
About-Golden-QnA.md

Conversation

@tanuprasad530
Copy link
Copy Markdown
Contributor

@tanuprasad530 tanuprasad530 commented May 26, 2026

Added a documentation about what is Golden QnA, its purpose how to develop a Golden QnA and points to note while creating a Golden QnA.

Summary by CodeRabbit

Release Notes

  • Documentation
    • Added new FAQ document covering "Golden QnA Set" concepts and definitions
    • Includes guidelines for developing quality datasets with categorized example scenarios
    • Provides best practices for creating accurate answers and a review checklist for consistency and evaluation standards

Review Change Stack

Added a documentation about what is Golden QnA, its purpose how to develop a Golden QnA and points to note while creating a Golden QnA.
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 26, 2026

Warning

Review limit reached

@tanuprasad530, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 49 minutes and 10 seconds. Learn how PR review limits work.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: fc8bd3dc-7820-4296-a7ed-f2f55d0054d0

📥 Commits

Reviewing files that changed from the base of the PR and between b1234fa and ac8534a.

📒 Files selected for processing (1)
  • docs/8. FAQ/About Golden QnA.md
📝 Walkthrough

Walkthrough

A new FAQ document is added to the documentation repository, defining Golden QnA Sets as curated datasets used for evaluation consistency. The document covers concept definitions, dataset development methodology with categorized examples, guidelines for creating accurate answers, and a validation checklist.

Changes

Golden QnA Set Documentation

Layer / File(s) Summary
Golden QnA Set Definition and Guidelines
docs/8. FAQ/About Golden QnA.md
Complete FAQ document covering Golden QnA Set concepts, purpose, and key characteristics; dataset development approach with categorized example scenarios; points to remember for creating accurate and evaluable answers; and a final review checklist for validation.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 A golden guide now hops through the docs,
QnA Sets shining, best evaluation unlocks,
With checklists and categories clear as can be,
The dataset consistency we all aim to see! ✨

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title 'Create About Golden QnA.md' is directly related to the main change—it adds a new FAQ documentation file. However, it is somewhat generic and file-centric rather than describing the actual content or purpose being documented.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch About-Golden-QnA.md

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

🧹 Nitpick comments (1)
docs/8. FAQ/About Golden QnA.md (1)

73-73: 💤 Low value

Consider more descriptive link text.

The link uses "here" as anchor text. More descriptive text like "in the development guidelines" would improve accessibility and clarity.

♻️ Proposed refactor
-- Are all categories mentioned [here](`#how-to-develop-golden-qnas`) covered adequately?
+- Are all categories mentioned [in the development guidelines](`#how-to-develop-golden-qnas`) covered adequately?
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/8`. FAQ/About Golden QnA.md at line 73, Replace the generic anchor text
"here" in the sentence "Are all categories mentioned
[here](`#how-to-develop-golden-qnas`) covered adequately?" with a more descriptive
phrase such as "in the development guidelines" (or similar) so the link reads
"Are all categories mentioned in the development guidelines covered adequately?"
to improve accessibility and clarity; update the markdown link target
[`#how-to-develop-golden-qnas`] while keeping the same fragment identifier.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@docs/8`. FAQ/About Golden QnA.md:
- Around line 61-74: Add a new section titled "Formatting Your Golden QnA
Dataset" immediately after the "Final Review Checklist" section that states the
dataset must be a CSV with each row as a question,answer pair (columns:
question, answer), includes the Golden QA CSV template link
(https://docs.google.com/spreadsheets/d/198UpOMeU53s9O-fwbIl0DIJLuD3l24jgkq74CoDfSQM/copy),
and explains the duplication factor (integer 1–5 indicating how many times
questions are repeated during evaluation); also add a short cross-reference
sentence pointing to the "AI Evaluations in Glific.md" for detailed usage
instructions so readers can both format and implement Golden QnAs in Glific.
- Line 53: Fix the typo in the table header: replace the string "<b>What to
avaoid</b>" with "<b>What to avoid</b>" in the header row that currently reads
"<b>To include</b> | <b>What to avaoid</b> | <b>Why it matters?</b>" so the
header correctly shows "What to avoid".
- Line 45: Fix the spelling in the table row that begins with "Practical
Situations" by changing the example question text from "3 monhs ki pregnancy" to
"3 months ki pregnancy"; locate the cell containing "Practical Situations |
Tests whether the chatbot..." and update the second example sentence to the
corrected spelling.
- Line 55: The table row in "About Golden QnA.md" contains an unclosed quotation
after the phrase "It depends"; update that cell by adding the missing closing
quotation mark immediately after It depends (i.e., change `“It depends` to `“It
depends”`) so the grammar and punctuation are correct for the Answers column.

---

Nitpick comments:
In `@docs/8`. FAQ/About Golden QnA.md:
- Line 73: Replace the generic anchor text "here" in the sentence "Are all
categories mentioned [here](`#how-to-develop-golden-qnas`) covered adequately?"
with a more descriptive phrase such as "in the development guidelines" (or
similar) so the link reads "Are all categories mentioned in the development
guidelines covered adequately?" to improve accessibility and clarity; update the
markdown link target [`#how-to-develop-golden-qnas`] while keeping the same
fragment identifier.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4b4be332-3f05-4683-9147-0d27720a0c8c

📥 Commits

Reviewing files that changed from the base of the PR and between b80c617 and b1234fa.

📒 Files selected for processing (1)
  • docs/8. FAQ/About Golden QnA.md

Comment thread docs/8. FAQ/About Golden QnA.md Outdated
| <b>Category</b> | <b>Purpose</b> | <b>Example Questions</b> |
|----------|----------|----------|
| Important Information (covering the most frequently asked themes) | Tests important factual information the chatbot should know. These should form the majority of the dataset. | 1. हजार दिवस क्या है ? <br> 2. Pregnancy mein aneamia ke kya lakshan hote hai?|
| Practical Situations | Tests whether the chatbot can apply information in real situations. | 1. C-section ke baad taake lage hai toh uska dekhbhaal kaise karien?<br> 2. 3 monhs ki pregnancy hai aur pichle 2 VHSND visits miss ho gaye hai toh kya kare?|
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix spelling error in example question.

Line 45 contains "3 monhs ki pregnancy" which should be "3 months ki pregnancy".

✏️ Proposed fix
-| Practical Situations    | Tests whether the chatbot can apply information in real situations.     | 1. C-section ke baad taake lage hai toh uska dekhbhaal kaise karien?<br> 2. 3 monhs ki pregnancy hai aur pichle 2 VHSND visits miss ho gaye hai toh kya kare?|
+| Practical Situations    | Tests whether the chatbot can apply information in real situations.     | 1. C-section ke baad taake lage hai toh uska dekhbhaal kaise karien?<br> 2. 3 months ki pregnancy hai aur pichle 2 VHSND visits miss ho gaye hai toh kya kare?|
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
| Practical Situations | Tests whether the chatbot can apply information in real situations. | 1. C-section ke baad taake lage hai toh uska dekhbhaal kaise karien?<br> 2. 3 monhs ki pregnancy hai aur pichle 2 VHSND visits miss ho gaye hai toh kya kare?|
| Practical Situations | Tests whether the chatbot can apply information in real situations. | 1. C-section ke baad taake lage hai toh uska dekhbhaal kaise karien?<br> 2. 3 months ki pregnancy hai aur pichle 2 VHSND visits miss ho gaye hai toh kya kare?|
🧰 Tools
🪛 LanguageTool

[grammar] ~45-~45: Ensure spelling is correct
Context: ...h uska dekhbhaal kaise karien?
2. 3 monhs ki pregnancy hai aur pichle 2 VHSND vis...

(QB_NEW_EN_ORTHOGRAPHY_ERROR_IDS_1)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/8`. FAQ/About Golden QnA.md at line 45, Fix the spelling in the table
row that begins with "Practical Situations" by changing the example question
text from "3 monhs ki pregnancy" to "3 months ki pregnancy"; locate the cell
containing "Practical Situations | Tests whether the chatbot..." and update the
second example sentence to the corrected spelling.


# Points to Remember While Creating Golden QnAs

| <b>To include</b> | <b>What to avaoid</b> | <b>Why it matters?</b> |
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix typo in table header.

The table header contains "What to avaoid" which should be "What to avoid".

✏️ Proposed fix
-| <b>To include</b> | <b>What to avaoid</b> | <b>Why it matters?</b> |
+| <b>To include</b> | <b>What to avoid</b> | <b>Why it matters?</b> |
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
| <b>To include</b> | <b>What to avaoid</b> | <b>Why it matters?</b> |
| <b>To include</b> | <b>What to avoid</b> | <b>Why it matters?</b> |
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/8`. FAQ/About Golden QnA.md at line 53, Fix the typo in the table
header: replace the string "<b>What to avaoid</b>" with "<b>What to avoid</b>"
in the header row that currently reads "<b>To include</b> | <b>What to
avaoid</b> | <b>Why it matters?</b>" so the header correctly shows "What to
avoid".


| <b>To include</b> | <b>What to avaoid</b> | <b>Why it matters?</b> |
|----------|----------|----------|
| Write clear, grammatically correct, specific, and confident answers | Answers with typos, broken grammar, or vague phrases like “Maybe”, “Could be”, “It depends | Golden Answers are expected to represent the ideal response. Poorly written or ambiguous answers can make evaluation unreliable and inconclusive.|
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add missing closing quotation mark.

The text ends with an unclosed quote after "It depends" which should be closed for proper grammar.

✏️ Proposed fix
-| Write clear, grammatically correct, specific, and confident answers    | Answers with typos, broken grammar, or vague phrases like "Maybe", "Could be", "It depends     | Golden Answers are expected to represent the ideal response. Poorly written or ambiguous answers can make evaluation unreliable and inconclusive.|
+| Write clear, grammatically correct, specific, and confident answers    | Answers with typos, broken grammar, or vague phrases like "Maybe", "Could be", "It depends"     | Golden Answers are expected to represent the ideal response. Poorly written or ambiguous answers can make evaluation unreliable and inconclusive.|
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
| Write clear, grammatically correct, specific, and confident answers | Answers with typos, broken grammar, or vague phrases like Maybe”, “Could be”, “It depends | Golden Answers are expected to represent the ideal response. Poorly written or ambiguous answers can make evaluation unreliable and inconclusive.|
| Write clear, grammatically correct, specific, and confident answers | Answers with typos, broken grammar, or vague phrases like "Maybe", "Could be", "It depends" | Golden Answers are expected to represent the ideal response. Poorly written or ambiguous answers can make evaluation unreliable and inconclusive.|
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/8`. FAQ/About Golden QnA.md at line 55, The table row in "About Golden
QnA.md" contains an unclosed quotation after the phrase "It depends"; update
that cell by adding the missing closing quotation mark immediately after It
depends (i.e., change `“It depends` to `“It depends”`) so the grammar and
punctuation are correct for the Answers column.

Comment on lines +61 to +74
# Final Review Checklist

Refer to the checklist below before finalizing the Golden QnA set.

- Is the answer factually correct?
- Is the answer grammatically correct?
- Does the question sound natural?
- Does the answer follow the prompt instructions?
- Is the answer clear and unambiguous?
- Is only one intent/category being tested?
- Is the fallback response consistent?
- Is the category correctly assigned?
- Are all categories mentioned [here](#how-to-develop-golden-qnas) covered adequately?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Add section explaining CSV format requirements and duplication factor.

This document provides excellent conceptual guidance for creating Golden QnAs, but it's missing critical technical information about how to format and use the dataset in Glific. Based on the upstream contract in AI Evaluations in Glific.md, users need to know:

  1. The Golden QA dataset must be a CSV file
  2. The CSV format: question, answer with one pair per row
  3. What the duplication factor is (number of times questions are repeated during evaluation, allowed values 1-5)
  4. Link to the Golden QA CSV template

Without this information, users who read this FAQ won't know how to actually implement their Golden QnAs in Glific.

Suggested addition:

Consider adding a new section after the "Final Review Checklist" titled "Formatting Your Golden QnA Dataset" that covers:

  • CSV file format requirements
  • Column structure (question, answer)
  • Link to the Golden QA CSV template
  • Brief explanation of duplication factor and its allowed values (1-5)
  • Cross-reference to the AI Evaluations documentation for detailed usage instructions

This bridges the gap between conceptual guidance (what makes good Golden QnAs) and practical implementation (how to format them for use in Glific).

🧰 Tools
🪛 LanguageTool

[style] ~71-~71: Three successive sentences begin with the same word. Consider rewording the sentence or use a thesaurus to find a synonym.
Context: ...nly one intent/category being tested? - Is the fallback response consistent? - Is ...

(ENGLISH_WORD_REPEAT_BEGINNING_RULE)


[style] ~72-~72: Three successive sentences begin with the same word. Consider rewording the sentence or use a thesaurus to find a synonym.
Context: ... Is the fallback response consistent? - Is the category correctly assigned? - Are ...

(ENGLISH_WORD_REPEAT_BEGINNING_RULE)

🪛 markdownlint-cli2 (0.22.1)

[warning] 73-73: Link text should be descriptive

(MD059, descriptive-link-text)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/8`. FAQ/About Golden QnA.md around lines 61 - 74, Add a new section
titled "Formatting Your Golden QnA Dataset" immediately after the "Final Review
Checklist" section that states the dataset must be a CSV with each row as a
question,answer pair (columns: question, answer), includes the Golden QA CSV
template link
(https://docs.google.com/spreadsheets/d/198UpOMeU53s9O-fwbIl0DIJLuD3l24jgkq74CoDfSQM/copy),
and explains the duplication factor (integer 1–5 indicating how many times
questions are repeated during evaluation); also add a short cross-reference
sentence pointing to the "AI Evaluations in Glific.md" for detailed usage
instructions so readers can both format and implement Golden QnAs in Glific.

added closing `<br/>` tags
@github-actions
Copy link
Copy Markdown

@github-actions github-actions Bot temporarily deployed to pull request May 26, 2026 15:03 Inactive
@mahajantejas mahajantejas merged commit 561724c into main May 28, 2026
7 checks passed
@mahajantejas mahajantejas deleted the About-Golden-QnA.md branch May 28, 2026 04:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants