Question 1

Which option best describes a large language model?

Accepted Answer

A neural network trained on large text corpora to predict tokens.. Here, A neural network trained on large text corpora to predict tokens. is the right choice. LLMs predict next tokens. It aligns directly with what the question asks about which option best describes a large language model. A quick elimination of partially true options helps confirm it.

Question 2

What is the primary purpose of a large language model?

Accepted Answer

A neural network trained on large text corpora to predict tokens.. In this case, A neural network trained on large text corpora to predict tokens. is correct. LLMs predict next tokens. It aligns directly with what the question asks about what is the primary purpose of a large. A quick elimination of partially true options helps confirm it.

Question 3

Which statement about a large language model is most accurate?

Accepted Answer

A neural network trained on large text corpora to predict tokens.. The best option here is A neural network trained on large text corpora to predict tokens.. LLMs predict next tokens. It aligns directly with what the question asks about which statement about a large language model is. A quick elimination of partially true options helps confirm it.

Question 4

How is a large language model best characterized?

Accepted Answer

A neural network trained on large text corpora to predict tokens.. For this question, A neural network trained on large text corpora to predict tokens. is correct. LLMs predict next tokens. It aligns directly with what the question asks about how is a large language model best characterized. A quick elimination of partially true options helps confirm it.

Question 5

Which option best describes a token?

Accepted Answer

A subword unit produced by the tokenizer.. A subword unit produced by the tokenizer. is the correct answer here. Granularity varies by tokenizer. It aligns directly with what the question asks about which option best describes a token. A quick elimination of partially true options helps confirm it.

Question 6

What is the primary purpose of a token?

Accepted Answer

A subword unit produced by the tokenizer.. Here, A subword unit produced by the tokenizer. is the right choice. Granularity varies by tokenizer. This matches the core idea being tested around what is the primary purpose of a token. A quick elimination of partially true options helps confirm it.

Question 7

Which statement about a token is most accurate?

Accepted Answer

A subword unit produced by the tokenizer.. In this case, A subword unit produced by the tokenizer. is correct. Granularity varies by tokenizer. This matches the core idea being tested around which statement about a token is most accurate. A quick elimination of partially true options helps confirm it.

Question 8

How is a token best characterized?

Accepted Answer

A subword unit produced by the tokenizer.. The best option here is A subword unit produced by the tokenizer.. Granularity varies by tokenizer. This matches the core idea being tested around how is a token best characterized. A quick elimination of partially true options helps confirm it.

Question 9

Which option best describes BPE tokenization?

Accepted Answer

Byte-pair encoding that merges frequent pairs into subwords.. For this question, Byte-pair encoding that merges frequent pairs into subwords. is correct. Common in modern LLMs. This matches the core idea being tested around which option best describes bpe tokenization. A quick elimination of partially true options helps confirm it.

Question 10

What is the primary purpose of BPE tokenization?

Accepted Answer

Byte-pair encoding that merges frequent pairs into subwords.. Byte-pair encoding that merges frequent pairs into subwords. is the correct answer here. Common in modern LLMs. This matches the core idea being tested around what is the primary purpose of bpe tokenization. A quick elimination of partially true options helps confirm it.

Question 11

Which statement about BPE tokenization is most accurate?

Accepted Answer

Byte-pair encoding that merges frequent pairs into subwords.. Here, Byte-pair encoding that merges frequent pairs into subwords. is the right choice. Common in modern LLMs. That is exactly the concept behind which statement about bpe tokenization is most accurate in this context. A quick elimination of partially true options helps confirm it.

Question 12

How is BPE tokenization best characterized?

Accepted Answer

Byte-pair encoding that merges frequent pairs into subwords.. In this case, Byte-pair encoding that merges frequent pairs into subwords. is correct. Common in modern LLMs. That is exactly the concept behind how is bpe tokenization best characterized in this context. A quick elimination of partially true options helps confirm it.

Question 13

Which option best describes the context window?

Accepted Answer

Max tokens the model can attend to per call.. The best option here is Max tokens the model can attend to per call.. Bounds prompt + completion size. That is exactly the concept behind which option best describes the context window in this context. A quick elimination of partially true options helps confirm it.

Question 14

What is the primary purpose of the context window?

Accepted Answer

Max tokens the model can attend to per call.. For this question, Max tokens the model can attend to per call. is correct. Bounds prompt + completion size. That is exactly the concept behind what is the primary purpose of the context in this context. A quick elimination of partially true options helps confirm it.

Question 15

Which statement about the context window is most accurate?

Accepted Answer

Max tokens the model can attend to per call.. Max tokens the model can attend to per call. is the correct answer here. Bounds prompt + completion size. That is exactly the concept behind which statement about the context window is most in this context. A quick elimination of partially true options helps confirm it.

Question 16

How is the context window best characterized?

Accepted Answer

Max tokens the model can attend to per call.. Here, Max tokens the model can attend to per call. is the right choice. Bounds prompt + completion size. It fits the requirement in the prompt about how is the context window best characterized. A quick elimination of partially true options helps confirm it.

Question 17

Which option best describes a parameter?

Accepted Answer

A learned weight in the model.. In this case, A learned weight in the model. is correct. Counts in billions for large LLMs. It fits the requirement in the prompt about which option best describes a parameter. A quick elimination of partially true options helps confirm it.

Question 18

What is the primary purpose of a parameter?

Accepted Answer

A learned weight in the model.. The best option here is A learned weight in the model.. Counts in billions for large LLMs. It fits the requirement in the prompt about what is the primary purpose of a parameter. A quick elimination of partially true options helps confirm it.

Question 19

Which statement about a parameter is most accurate?

Accepted Answer

A learned weight in the model.. For this question, A learned weight in the model. is correct. Counts in billions for large LLMs. It fits the requirement in the prompt about which statement about a parameter is most accurate. A quick elimination of partially true options helps confirm it.

Question 20

How is a parameter best characterized?

Accepted Answer

A learned weight in the model.. A learned weight in the model. is the correct answer here. Counts in billions for large LLMs. It fits the requirement in the prompt about how is a parameter best characterized. A quick elimination of partially true options helps confirm it.

Question 21

Which option best describes a prompt?

Accepted Answer

Input text guiding the LLM's response.. Here, Input text guiding the LLM's response. is the right choice. Quality of prompt drives quality of output. This is the most accurate statement for which option best describes a prompt. A quick elimination of partially true options helps confirm it.

Question 22

What is the primary purpose of a prompt?

Accepted Answer

Input text guiding the LLM's response.. In this case, Input text guiding the LLM's response. is correct. Quality of prompt drives quality of output. This is the most accurate statement for what is the primary purpose of a prompt. A quick elimination of partially true options helps confirm it.

Question 23

Which statement about a prompt is most accurate?

Accepted Answer

Input text guiding the LLM's response.. The best option here is Input text guiding the LLM's response.. Quality of prompt drives quality of output. This is the most accurate statement for which statement about a prompt is most accurate. A quick elimination of partially true options helps confirm it.

Question 24

How is a prompt best characterized?

Accepted Answer

Input text guiding the LLM's response.. For this question, Input text guiding the LLM's response. is correct. Quality of prompt drives quality of output. This is the most accurate statement for how is a prompt best characterized. A quick elimination of partially true options helps confirm it.

Question 25

Which option best describes temperature?

Accepted Answer

Sampling parameter controlling randomness.. Sampling parameter controlling randomness. is the correct answer here. Lower = more deterministic. This is the most accurate statement for which option best describes temperature. A quick elimination of partially true options helps confirm it.

Question 26

What is the primary purpose of temperature?

Accepted Answer

Sampling parameter controlling randomness.. Here, Sampling parameter controlling randomness. is the right choice. Lower = more deterministic. It aligns directly with what the question asks about what is the primary purpose of temperature. The other options are either incomplete or contextually incorrect.

Question 27

Which statement about temperature is most accurate?

Accepted Answer

Sampling parameter controlling randomness.. In this case, Sampling parameter controlling randomness. is correct. Lower = more deterministic. It aligns directly with what the question asks about which statement about temperature is most accurate. The other options are either incomplete or contextually incorrect.

Question 28

How is temperature best characterized?

Accepted Answer

Sampling parameter controlling randomness.. The best option here is Sampling parameter controlling randomness.. Lower = more deterministic. It aligns directly with what the question asks about how is temperature best characterized. The other options are either incomplete or contextually incorrect.

Question 29

Which option best describes top-p sampling?

Accepted Answer

Sample from smallest set of tokens with cumulative prob ≥ p.. For this question, Sample from smallest set of tokens with cumulative prob ≥ p. is correct. Controls quality/diversity. It aligns directly with what the question asks about which option best describes top-p sampling. The other options are either incomplete or contextually incorrect.

Question 30

What is the primary purpose of top-p sampling?

Accepted Answer

Sample from smallest set of tokens with cumulative prob ≥ p.. Sample from smallest set of tokens with cumulative prob ≥ p. is the correct answer here. Controls quality/diversity. It aligns directly with what the question asks about what is the primary purpose of top-p sampling. The other options are either incomplete or contextually incorrect.

Question 31

Which statement about top-p sampling is most accurate?

Accepted Answer

Sample from smallest set of tokens with cumulative prob ≥ p.. Here, Sample from smallest set of tokens with cumulative prob ≥ p. is the right choice. Controls quality/diversity. This matches the core idea being tested around which statement about top-p sampling is most accurate. The other options are either incomplete or contextually incorrect.

Question 32

How is top-p sampling best characterized?

Accepted Answer

Sample from smallest set of tokens with cumulative prob ≥ p.. In this case, Sample from smallest set of tokens with cumulative prob ≥ p. is correct. Controls quality/diversity. This matches the core idea being tested around how is top-p sampling best characterized. The other options are either incomplete or contextually incorrect.

Question 33

Which option best describes top-k sampling?

Accepted Answer

Sample from the k highest-probability tokens.. The best option here is Sample from the k highest-probability tokens.. Limits low-probability tail. This matches the core idea being tested around which option best describes top-k sampling. The other options are either incomplete or contextually incorrect.

Question 34

What is the primary purpose of top-k sampling?

Accepted Answer

Sample from the k highest-probability tokens.. For this question, Sample from the k highest-probability tokens. is correct. Limits low-probability tail. This matches the core idea being tested around what is the primary purpose of top-k sampling. The other options are either incomplete or contextually incorrect.

Question 35

Which statement about top-k sampling is most accurate?

Accepted Answer

Sample from the k highest-probability tokens.. Sample from the k highest-probability tokens. is the correct answer here. Limits low-probability tail. This matches the core idea being tested around which statement about top-k sampling is most accurate. The other options are either incomplete or contextually incorrect.

Question 36

How is top-k sampling best characterized?

Accepted Answer

Sample from the k highest-probability tokens.. Here, Sample from the k highest-probability tokens. is the right choice. Limits low-probability tail. That is exactly the concept behind how is top-k sampling best characterized in this context. The other options are either incomplete or contextually incorrect.

Question 37

Which option best describes greedy decoding?

Accepted Answer

Always picking the most likely next token.. In this case, Always picking the most likely next token. is correct. Deterministic but can be repetitive. That is exactly the concept behind which option best describes greedy decoding in this context. The other options are either incomplete or contextually incorrect.

Question 38

What is the primary purpose of greedy decoding?

Accepted Answer

Always picking the most likely next token.. The best option here is Always picking the most likely next token.. Deterministic but can be repetitive. That is exactly the concept behind what is the primary purpose of greedy decoding in this context. The other options are either incomplete or contextually incorrect.

Question 39

Which statement about greedy decoding is most accurate?

Accepted Answer

Always picking the most likely next token.. For this question, Always picking the most likely next token. is correct. Deterministic but can be repetitive. That is exactly the concept behind which statement about greedy decoding is most accurate in this context. The other options are either incomplete or contextually incorrect.

Question 40

How is greedy decoding best characterized?

Accepted Answer

Always picking the most likely next token.. Always picking the most likely next token. is the correct answer here. Deterministic but can be repetitive. That is exactly the concept behind how is greedy decoding best characterized in this context. The other options are either incomplete or contextually incorrect.

Question 41

Which option best describes beam search?

Accepted Answer

Maintain top-k partial sequences and expand each.. Here, Maintain top-k partial sequences and expand each. is the right choice. Better for structured tasks. It fits the requirement in the prompt about which option best describes beam search. The other options are either incomplete or contextually incorrect.

Question 42

What is the primary purpose of beam search?

Accepted Answer

Maintain top-k partial sequences and expand each.. In this case, Maintain top-k partial sequences and expand each. is correct. Better for structured tasks. It fits the requirement in the prompt about what is the primary purpose of beam search. The other options are either incomplete or contextually incorrect.

Question 43

Which statement about beam search is most accurate?

Accepted Answer

Maintain top-k partial sequences and expand each.. The best option here is Maintain top-k partial sequences and expand each.. Better for structured tasks. It fits the requirement in the prompt about which statement about beam search is most accurate. The other options are either incomplete or contextually incorrect.

Question 44

How is beam search best characterized?

Accepted Answer

Maintain top-k partial sequences and expand each.. For this question, Maintain top-k partial sequences and expand each. is correct. Better for structured tasks. It fits the requirement in the prompt about how is beam search best characterized. The other options are either incomplete or contextually incorrect.

Question 45

Which option best describes an instruction-tuned model?

Accepted Answer

An LLM further trained to follow instructions.. An LLM further trained to follow instructions. is the correct answer here. Examples: chat-tuned models. It fits the requirement in the prompt about which option best describes an instruction-tuned model. The other options are either incomplete or contextually incorrect.

Question 46

What is the primary purpose of an instruction-tuned model?

Accepted Answer

An LLM further trained to follow instructions.. Here, An LLM further trained to follow instructions. is the right choice. Examples: chat-tuned models. This is the most accurate statement for what is the primary purpose of an instruction-tuned. The other options are either incomplete or contextually incorrect.

Question 47

Which statement about an instruction-tuned model is most accurate?

Accepted Answer

An LLM further trained to follow instructions.. In this case, An LLM further trained to follow instructions. is correct. Examples: chat-tuned models. This is the most accurate statement for which statement about an instruction-tuned model is most. The other options are either incomplete or contextually incorrect.

Question 48

How is an instruction-tuned model best characterized?

Accepted Answer

An LLM further trained to follow instructions.. The best option here is An LLM further trained to follow instructions.. Examples: chat-tuned models. This is the most accurate statement for how is an instruction-tuned model best characterized. The other options are either incomplete or contextually incorrect.

Question 49

Which option best describes RLHF?

Accepted Answer

Reinforcement learning from human feedback to align models.. For this question, Reinforcement learning from human feedback to align models. is correct. Improves helpfulness and safety. This is the most accurate statement for which option best describes rlhf. The other options are either incomplete or contextually incorrect.

Question 50

What is the primary purpose of RLHF?

Accepted Answer

Reinforcement learning from human feedback to align models.. Reinforcement learning from human feedback to align models. is the correct answer here. Improves helpfulness and safety. This is the most accurate statement for what is the primary purpose of rlhf. The other options are either incomplete or contextually incorrect.

AI LLM Basics MCQ Questions with Answers (Latest 2026)

Q1. Which option best describes a large language model?

Q2. What is the primary purpose of a large language model?

Q3. Which statement about a large language model is most accurate?

Q4. How is a large language model best characterized?

Q5. Which option best describes a token?

Q6. What is the primary purpose of a token?

Q7. Which statement about a token is most accurate?

Q8. How is a token best characterized?

Q9. Which option best describes BPE tokenization?

Q10. What is the primary purpose of BPE tokenization?

Q11. Which statement about BPE tokenization is most accurate?

Q12. How is BPE tokenization best characterized?

Q13. Which option best describes the context window?

Q14. What is the primary purpose of the context window?

Q15. Which statement about the context window is most accurate?

Q16. How is the context window best characterized?

Q17. Which option best describes a parameter?

Q18. What is the primary purpose of a parameter?

Q19. Which statement about a parameter is most accurate?

Q20. How is a parameter best characterized?

Q21. Which option best describes a prompt?

Q22. What is the primary purpose of a prompt?

Q23. Which statement about a prompt is most accurate?

Q24. How is a prompt best characterized?

Q25. Which option best describes temperature?

Q26. What is the primary purpose of temperature?

Q27. Which statement about temperature is most accurate?

Q28. How is temperature best characterized?

Q29. Which option best describes top-p sampling?

Q30. What is the primary purpose of top-p sampling?

Q31. Which statement about top-p sampling is most accurate?

Q32. How is top-p sampling best characterized?

Q33. Which option best describes top-k sampling?

Q34. What is the primary purpose of top-k sampling?

Q35. Which statement about top-k sampling is most accurate?

Q36. How is top-k sampling best characterized?

Q37. Which option best describes greedy decoding?

Q38. What is the primary purpose of greedy decoding?

Q39. Which statement about greedy decoding is most accurate?

Q40. How is greedy decoding best characterized?

Q41. Which option best describes beam search?

Q42. What is the primary purpose of beam search?

Q43. Which statement about beam search is most accurate?

Q44. How is beam search best characterized?

Q45. Which option best describes an instruction-tuned model?

Q46. What is the primary purpose of an instruction-tuned model?

Q47. Which statement about an instruction-tuned model is most accurate?

Q48. How is an instruction-tuned model best characterized?

Q49. Which option best describes RLHF?

Q50. What is the primary purpose of RLHF?