Question 1

Which statement about eval prompts is most accurate?

Accepted Answer

Prompts that constrain LLM judge output.. Here, Prompts that constrain LLM judge output. is the right choice. Improves judge consistency. It aligns directly with what the question asks about which statement about eval prompts is most accurate. Competing choices sound plausible, but they miss the key condition.

Question 2

How is eval prompts best characterized?

Accepted Answer

Prompts that constrain LLM judge output.. In this case, Prompts that constrain LLM judge output. is correct. Improves judge consistency. It aligns directly with what the question asks about how is eval prompts best characterized. Competing choices sound plausible, but they miss the key condition.

Question 3

Which option best describes citation accuracy?

Accepted Answer

Cited spans actually support the claim.. The best option here is Cited spans actually support the claim.. Targets hallucinated citations. It aligns directly with what the question asks about which option best describes citation accuracy. Competing choices sound plausible, but they miss the key condition.

Question 4

What is the primary purpose of citation accuracy?

Accepted Answer

Cited spans actually support the claim.. For this question, Cited spans actually support the claim. is correct. Targets hallucinated citations. It aligns directly with what the question asks about what is the primary purpose of citation accuracy. Competing choices sound plausible, but they miss the key condition.

Question 5

Which statement about citation accuracy is most accurate?

Accepted Answer

Cited spans actually support the claim.. Cited spans actually support the claim. is the correct answer here. Targets hallucinated citations. It aligns directly with what the question asks about which statement about citation accuracy is most accurate. Competing choices sound plausible, but they miss the key condition.

Question 6

How is citation accuracy best characterized?

Accepted Answer

Cited spans actually support the claim.. Here, Cited spans actually support the claim. is the right choice. Targets hallucinated citations. This matches the core idea being tested around how is citation accuracy best characterized. Competing choices sound plausible, but they miss the key condition.

Question 7

Which option best describes retrieval ablation?

Accepted Answer

Compare with/without retrieval.. In this case, Compare with/without retrieval. is correct. Quantifies retrieval value. This matches the core idea being tested around which option best describes retrieval ablation. Competing choices sound plausible, but they miss the key condition.

Question 8

What is the primary purpose of retrieval ablation?

Accepted Answer

Compare with/without retrieval.. The best option here is Compare with/without retrieval.. Quantifies retrieval value. This matches the core idea being tested around what is the primary purpose of retrieval ablation. Competing choices sound plausible, but they miss the key condition.

Question 9

Which statement about retrieval ablation is most accurate?

Accepted Answer

Compare with/without retrieval.. For this question, Compare with/without retrieval. is correct. Quantifies retrieval value. This matches the core idea being tested around which statement about retrieval ablation is most accurate. Competing choices sound plausible, but they miss the key condition.

Question 10

How is retrieval ablation best characterized?

Accepted Answer

Compare with/without retrieval.. Compare with/without retrieval. is the correct answer here. Quantifies retrieval value. This matches the core idea being tested around how is retrieval ablation best characterized. Competing choices sound plausible, but they miss the key condition.

Question 11

Which option best describes context window pressure?

Accepted Answer

Too many chunks crowding the prompt.. Here, Too many chunks crowding the prompt. is the right choice. Hurts answer quality. That is exactly the concept behind which option best describes context window pressure in this context. Competing choices sound plausible, but they miss the key condition.

Question 12

What is the primary purpose of context window pressure?

Accepted Answer

Too many chunks crowding the prompt.. In this case, Too many chunks crowding the prompt. is correct. Hurts answer quality. That is exactly the concept behind what is the primary purpose of context window in this context. Competing choices sound plausible, but they miss the key condition.

Question 13

Which statement about context window pressure is most accurate?

Accepted Answer

Too many chunks crowding the prompt.. The best option here is Too many chunks crowding the prompt.. Hurts answer quality. That is exactly the concept behind which statement about context window pressure is most in this context. Competing choices sound plausible, but they miss the key condition.

Question 14

How is context window pressure best characterized?

Accepted Answer

Too many chunks crowding the prompt.. For this question, Too many chunks crowding the prompt. is correct. Hurts answer quality. That is exactly the concept behind how is context window pressure best characterized in this context. Competing choices sound plausible, but they miss the key condition.

Question 15

Which option best describes chunk size tuning?

Accepted Answer

Optimizing chunk length for retrieval/answer.. Optimizing chunk length for retrieval/answer. is the correct answer here. Affects recall and noise. That is exactly the concept behind which option best describes chunk size tuning in this context. Competing choices sound plausible, but they miss the key condition.

Question 16

What is the primary purpose of chunk size tuning?

Accepted Answer

Optimizing chunk length for retrieval/answer.. Here, Optimizing chunk length for retrieval/answer. is the right choice. Affects recall and noise. It fits the requirement in the prompt about what is the primary purpose of chunk size. Competing choices sound plausible, but they miss the key condition.

Question 17

Which statement about chunk size tuning is most accurate?

Accepted Answer

Optimizing chunk length for retrieval/answer.. In this case, Optimizing chunk length for retrieval/answer. is correct. Affects recall and noise. It fits the requirement in the prompt about which statement about chunk size tuning is most. Competing choices sound plausible, but they miss the key condition.

Question 18

How is chunk size tuning best characterized?

Accepted Answer

Optimizing chunk length for retrieval/answer.. The best option here is Optimizing chunk length for retrieval/answer.. Affects recall and noise. It fits the requirement in the prompt about how is chunk size tuning best characterized. Competing choices sound plausible, but they miss the key condition.

Question 19

Which option best describes hybrid search?

Accepted Answer

Combine BM25 and dense retrieval.. For this question, Combine BM25 and dense retrieval. is correct. Often improves recall. It fits the requirement in the prompt about which option best describes hybrid search. Competing choices sound plausible, but they miss the key condition.

Question 20

What is the primary purpose of hybrid search?

Accepted Answer

Combine BM25 and dense retrieval.. Combine BM25 and dense retrieval. is the correct answer here. Often improves recall. It fits the requirement in the prompt about what is the primary purpose of hybrid search. Competing choices sound plausible, but they miss the key condition.

Question 21

Which statement about hybrid search is most accurate?

Accepted Answer

Combine BM25 and dense retrieval.. Here, Combine BM25 and dense retrieval. is the right choice. Often improves recall. This is the most accurate statement for which statement about hybrid search is most accurate. Competing choices sound plausible, but they miss the key condition.

Question 22

How is hybrid search best characterized?

Accepted Answer

Combine BM25 and dense retrieval.. In this case, Combine BM25 and dense retrieval. is correct. Often improves recall. This is the most accurate statement for how is hybrid search best characterized. Competing choices sound plausible, but they miss the key condition.

Question 23

Which option best describes reranking?

Accepted Answer

Reorder top-k with a stronger model.. The best option here is Reorder top-k with a stronger model.. Improves top-k quality. This is the most accurate statement for which option best describes reranking. Competing choices sound plausible, but they miss the key condition.

Question 24

What is the primary purpose of reranking?

Accepted Answer

Reorder top-k with a stronger model.. For this question, Reorder top-k with a stronger model. is correct. Improves top-k quality. This is the most accurate statement for what is the primary purpose of reranking. Competing choices sound plausible, but they miss the key condition.

Question 25

Which statement about reranking is most accurate?

Accepted Answer

Reorder top-k with a stronger model.. Reorder top-k with a stronger model. is the correct answer here. Improves top-k quality. This is the most accurate statement for which statement about reranking is most accurate. Competing choices sound plausible, but they miss the key condition.

Question 26

How is reranking best characterized?

Accepted Answer

Reorder top-k with a stronger model.. Here, Reorder top-k with a stronger model. is the right choice. Improves top-k quality. It aligns directly with what the question asks about how is reranking best characterized. The remaining choices fail because they don’t satisfy the full definition.

Question 27

Which option best describes query rewriting?

Accepted Answer

Rephrase question to improve retrieval.. In this case, Rephrase question to improve retrieval. is correct. Boosts recall on ambiguous queries. It aligns directly with what the question asks about which option best describes query rewriting. The remaining choices fail because they don’t satisfy the full definition.

Question 28

What is the primary purpose of query rewriting?

Accepted Answer

Rephrase question to improve retrieval.. The best option here is Rephrase question to improve retrieval.. Boosts recall on ambiguous queries. It aligns directly with what the question asks about what is the primary purpose of query rewriting. The remaining choices fail because they don’t satisfy the full definition.

Question 29

Which statement about query rewriting is most accurate?

Accepted Answer

Rephrase question to improve retrieval.. For this question, Rephrase question to improve retrieval. is correct. Boosts recall on ambiguous queries. It aligns directly with what the question asks about which statement about query rewriting is most accurate. The remaining choices fail because they don’t satisfy the full definition.

Question 30

How is query rewriting best characterized?

Accepted Answer

Rephrase question to improve retrieval.. Rephrase question to improve retrieval. is the correct answer here. Boosts recall on ambiguous queries. It aligns directly with what the question asks about how is query rewriting best characterized. The remaining choices fail because they don’t satisfy the full definition.

Question 31

Which option best describes ablation drift?

Accepted Answer

Eval differences over time without changes.. Here, Eval differences over time without changes. is the right choice. Indicates index/data drift. This matches the core idea being tested around which option best describes ablation drift. The remaining choices fail because they don’t satisfy the full definition.

Question 32

What is the primary purpose of ablation drift?

Accepted Answer

Eval differences over time without changes.. In this case, Eval differences over time without changes. is correct. Indicates index/data drift. This matches the core idea being tested around what is the primary purpose of ablation drift. The remaining choices fail because they don’t satisfy the full definition.

Question 33

Which statement about ablation drift is most accurate?

Accepted Answer

Eval differences over time without changes.. The best option here is Eval differences over time without changes.. Indicates index/data drift. This matches the core idea being tested around which statement about ablation drift is most accurate. The remaining choices fail because they don’t satisfy the full definition.

Question 34

How is ablation drift best characterized?

Accepted Answer

Eval differences over time without changes.. For this question, Eval differences over time without changes. is correct. Indicates index/data drift. This matches the core idea being tested around how is ablation drift best characterized. The remaining choices fail because they don’t satisfy the full definition.

Question 35

Which option best describes safety eval for RAG?

Accepted Answer

Check for unsafe answers from retrieved content.. Check for unsafe answers from retrieved content. is the correct answer here. Prompt injection in docs is a risk. This matches the core idea being tested around which option best describes safety eval for rag. The remaining choices fail because they don’t satisfy the full definition.

Question 36

What is the primary purpose of safety eval for RAG?

Accepted Answer

Check for unsafe answers from retrieved content.. Here, Check for unsafe answers from retrieved content. is the right choice. Prompt injection in docs is a risk. That is exactly the concept behind what is the primary purpose of safety eval in this context. The remaining choices fail because they don’t satisfy the full definition.

Question 37

Which statement about safety eval for RAG is most accurate?

Accepted Answer

Check for unsafe answers from retrieved content.. In this case, Check for unsafe answers from retrieved content. is correct. Prompt injection in docs is a risk. That is exactly the concept behind which statement about safety eval for rag is in this context. The remaining choices fail because they don’t satisfy the full definition.

Question 38

How is safety eval for RAG best characterized?

Accepted Answer

Check for unsafe answers from retrieved content.. The best option here is Check for unsafe answers from retrieved content.. Prompt injection in docs is a risk. That is exactly the concept behind how is safety eval for rag best characterized in this context. The remaining choices fail because they don’t satisfy the full definition.

Question 39

Which option best describes offline RAG eval?

Accepted Answer

Run eval against a dataset, not production.. For this question, Run eval against a dataset, not production. is correct. Stable and repeatable. That is exactly the concept behind which option best describes offline rag eval in this context. The remaining choices fail because they don’t satisfy the full definition.

Question 40

What is the primary purpose of offline RAG eval?

Accepted Answer

Run eval against a dataset, not production.. Run eval against a dataset, not production. is the correct answer here. Stable and repeatable. That is exactly the concept behind what is the primary purpose of offline rag in this context. The remaining choices fail because they don’t satisfy the full definition.

Question 41

Which statement about offline RAG eval is most accurate?

Accepted Answer

Run eval against a dataset, not production.. Here, Run eval against a dataset, not production. is the right choice. Stable and repeatable. It fits the requirement in the prompt about which statement about offline rag eval is most. The remaining choices fail because they don’t satisfy the full definition.

Question 42

How is offline RAG eval best characterized?

Accepted Answer

Run eval against a dataset, not production.. In this case, Run eval against a dataset, not production. is correct. Stable and repeatable. It fits the requirement in the prompt about how is offline rag eval best characterized. The remaining choices fail because they don’t satisfy the full definition.

Question 43

Which option best describes online RAG eval?

Accepted Answer

Sample real traffic for eval signals.. The best option here is Sample real traffic for eval signals.. Captures real distributions. It fits the requirement in the prompt about which option best describes online rag eval. The remaining choices fail because they don’t satisfy the full definition.

Question 44

What is the primary purpose of online RAG eval?

Accepted Answer

Sample real traffic for eval signals.. For this question, Sample real traffic for eval signals. is correct. Captures real distributions. It fits the requirement in the prompt about what is the primary purpose of online rag. The remaining choices fail because they don’t satisfy the full definition.

Question 45

Which statement about online RAG eval is most accurate?

Accepted Answer

Sample real traffic for eval signals.. Sample real traffic for eval signals. is the correct answer here. Captures real distributions. It fits the requirement in the prompt about which statement about online rag eval is most. The remaining choices fail because they don’t satisfy the full definition.

Question 46

How is online RAG eval best characterized?

Accepted Answer

Sample real traffic for eval signals.. Here, Sample real traffic for eval signals. is the right choice. Captures real distributions. This is the most accurate statement for how is online rag eval best characterized. The remaining choices fail because they don’t satisfy the full definition.

Question 47

Which option best describes eval cost budgets?

Accepted Answer

Caps on judge LLM spend per run.. In this case, Caps on judge LLM spend per run. is correct. Sustainable continuous eval. This is the most accurate statement for which option best describes eval cost budgets. The remaining choices fail because they don’t satisfy the full definition.

Question 48

What is the primary purpose of eval cost budgets?

Accepted Answer

Caps on judge LLM spend per run.. The best option here is Caps on judge LLM spend per run.. Sustainable continuous eval. This is the most accurate statement for what is the primary purpose of eval cost. The remaining choices fail because they don’t satisfy the full definition.

Question 49

Which statement about eval cost budgets is most accurate?

Accepted Answer

Caps on judge LLM spend per run.. For this question, Caps on judge LLM spend per run. is correct. Sustainable continuous eval. This is the most accurate statement for which statement about eval cost budgets is most. The remaining choices fail because they don’t satisfy the full definition.

Question 50

How is eval cost budgets best characterized?

Accepted Answer

Caps on judge LLM spend per run.. Caps on judge LLM spend per run. is the correct answer here. Sustainable continuous eval. This is the most accurate statement for how is eval cost budgets best characterized. The remaining choices fail because they don’t satisfy the full definition.

AI RAG Evaluation MCQ Questions with Answers – Page 2 (Latest 2026)

Q51. Which statement about eval prompts is most accurate?

Q52. How is eval prompts best characterized?

Q53. Which option best describes citation accuracy?

Q54. What is the primary purpose of citation accuracy?

Q55. Which statement about citation accuracy is most accurate?

Q56. How is citation accuracy best characterized?

Q57. Which option best describes retrieval ablation?

Q58. What is the primary purpose of retrieval ablation?

Q59. Which statement about retrieval ablation is most accurate?

Q60. How is retrieval ablation best characterized?

Q61. Which option best describes context window pressure?

Q62. What is the primary purpose of context window pressure?

Q63. Which statement about context window pressure is most accurate?

Q64. How is context window pressure best characterized?

Q65. Which option best describes chunk size tuning?

Q66. What is the primary purpose of chunk size tuning?

Q67. Which statement about chunk size tuning is most accurate?

Q68. How is chunk size tuning best characterized?

Q69. Which option best describes hybrid search?

Q70. What is the primary purpose of hybrid search?

Q71. Which statement about hybrid search is most accurate?

Q72. How is hybrid search best characterized?

Q73. Which option best describes reranking?

Q74. What is the primary purpose of reranking?

Q75. Which statement about reranking is most accurate?

Q76. How is reranking best characterized?

Q77. Which option best describes query rewriting?

Q78. What is the primary purpose of query rewriting?

Q79. Which statement about query rewriting is most accurate?

Q80. How is query rewriting best characterized?

Q81. Which option best describes ablation drift?

Q82. What is the primary purpose of ablation drift?

Q83. Which statement about ablation drift is most accurate?

Q84. How is ablation drift best characterized?

Q85. Which option best describes safety eval for RAG?

Q86. What is the primary purpose of safety eval for RAG?

Q87. Which statement about safety eval for RAG is most accurate?

Q88. How is safety eval for RAG best characterized?

Q89. Which option best describes offline RAG eval?

Q90. What is the primary purpose of offline RAG eval?

Q91. Which statement about offline RAG eval is most accurate?

Q92. How is offline RAG eval best characterized?

Q93. Which option best describes online RAG eval?

Q94. What is the primary purpose of online RAG eval?

Q95. Which statement about online RAG eval is most accurate?

Q96. How is online RAG eval best characterized?

Q97. Which option best describes eval cost budgets?

Q98. What is the primary purpose of eval cost budgets?

Q99. Which statement about eval cost budgets is most accurate?

Q100. How is eval cost budgets best characterized?