conversation_id stringlengths 36 36 | dataset stringclasses 1
value | first_query stringlengths 5 1.29k | category stringclasses 9
values | domain stringclasses 9
values | lang stringclasses 1
value | is_haystack bool 1
class | mode stringclasses 1
value | length int64 8 12 | turns stringlengths 18.6k 152k | fact_ledger stringlengths 2 608 | cross_turn_checks stringlengths 62 2.96k | holistic_score stringclasses 1
value |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
b91e7d4a-1e51-4524-8ff2-c0d694c611db | seeds | I am depressed and anxious a lot of the time. I am tired of it and don't know what to do. What do you recommend for me to do to be less depressed and anxious? | general | general | en | false | reasoning | 8 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "I am depressed and anxious a lot of the time. I am tired of it and don't know what to do. What do you recommend for me to do to be less depressed and anxious?"}, {"segment_type": "answer", "text": ".\n- Final answer after .\n- No memory denial (though t... | [] | {"delayed_recall": [], "standing": [{"constraint_turn": 2, "checked_turn": 3, "type": "json_valid", "passed": false, "detail": "invalid JSON: Expecting value: line 1 column 1 (char 0)"}, {"constraint_turn": 2, "checked_turn": 4, "type": "json_valid", "passed": false, "detail": "invalid JSON: Expecting value: line 1 col... | null |
4688b46d-de20-4fed-9336-d3515c04368f | seeds | Organize these words in alphabetic order and also from shortest to longest words.
-Casa
-House
-Weapon
-Computer
-Dimension
-Fire | general | general | en | false | reasoning | 8 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "Organize these words in alphabetic order and also from shortest to longest words.\n\n-Casa\n\n-House\n\n-Weapon\n\n-Computer\n\n-Dimension\n\n-Fire"}, {"segment_type": "reasoning", "text": "The user wants me to organize a list of words in two ways:\n1. ... | [{"fact_id": "f1", "value": "Tier-3", "planted_turn": 2, "fact_type": "subscription_tier", "value_history": [{"turn": 2, "value": "Sapphire"}, {"turn": 2, "value": "Tier-3"}]}] | {"delayed_recall": [], "standing": [{"constraint_turn": 6, "checked_turn": 7, "type": "json_valid", "passed": false, "detail": "invalid JSON: Expecting value: line 1 column 1 (char 0)"}], "update_overwrite": [{"turn_index": 2, "fact_id": "f1", "passed": true, "detail": "current value present, no stale values"}]} | null |
3b8de115-66b9-45cd-99eb-6838bc874260 | seeds | What are the key differences between classical and operant conditioning in psychology? | general | general | en | false | reasoning | 11 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "What are the key differences between classical and operant conditioning in psychology?"}, {"segment_type": "reasoning", "text": "The user is asking about the key differences between classical and operant conditioning in psychology. This is a straightfor... | [] | {"delayed_recall": [], "standing": [], "update_overwrite": []} | null |
2dbaf751-d663-42a7-999d-07d5b19e360d | seeds | What's up doc? | general | general | en | false | reasoning | 10 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "What's up doc?"}, {"segment_type": "reasoning", "text": "The user is greeting me with a casual phrase \"What's up doc?\" which is a famous catchphrase associated with the cartoon character Bugs Bunny. I should respond in a friendly, casual manner that a... | [{"fact_id": "f1", "value": "Salta", "planted_turn": 9, "fact_type": "office_city", "value_history": [{"turn": 9, "value": "Salta"}]}] | {"delayed_recall": [], "standing": [], "update_overwrite": []} | null |
08b08ad4-d7b8-4bf0-a331-7b6106c80618 | seeds | what are the risks of using AI to create code for a professional programmer | coding | coding | en | false | reasoning | 8 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"what are the risks of(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"crimson\", \"planted_turn\": 3, \"fact_type\": \"favorite_color(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 1, \"checked_turn\": 2, \"type\": \"l(...TRUNCATED) | null |
26a6f500-63a7-4832-a7fa-962af7b6316d | seeds | "What are some unique things about the 37th largest city in Japan? Does it have anything in common w(...TRUNCATED) | general | general | en | false | reasoning | 10 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"What are some unique (...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"77364\", \"planted_turn\": 3, \"fact_type\": \"badge_number\", (...TRUNCATED) | "{\"delayed_recall\": [{\"turn_index\": 5, \"fact_id\": \"f1\", \"passed\": true, \"detail\": \"f1 l(...TRUNCATED) | null |
84aa914e-013f-4bbe-b972-a40cfb9b417e | seeds | "explain in first principles the business model behind politicians such as senators, their incentive(...TRUNCATED) | general | general | en | false | reasoning | 12 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"explain in first prin(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"Nimbus\", \"planted_turn\": 11, \"fact_type\": \"pet_name\", \"(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 2, \"checked_turn\": 3, \"type\": \"j(...TRUNCATED) | null |
311a3aec-8f87-43a9-be7c-faf57220188d | seeds | On average, how tall is a robot vacuum cleaner? | general | general | en | false | reasoning | 8 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"On average, how tall (...TRUNCATED) | [] | {"delayed_recall": [], "standing": [], "update_overwrite": []} | null |
281d6aa6-224c-45e2-be53-39b9009ca33c | seeds | can you tell me who mark seifter is? especially focusing on his game design and creation history | humanities | humanities | en | false | reasoning | 11 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"can you tell me who m(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"Salta\", \"planted_turn\": 5, \"fact_type\": \"office_city\", \(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [], \"update_overwrite\": [{\"turn_index\": 5, \"fact_id\": (...TRUNCATED) | null |
d9ecc033-6728-4b41-9b54-dfbd4d817718 | seeds | What kind of safety measures does your programming include? | general | general | en | false | reasoning | 9 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"What kind of safety m(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"8929\", \"planted_turn\": 3, \"fact_type\": \"lucky_number\", \(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 2, \"checked_turn\": 3, \"type\": \"l(...TRUNCATED) | null |
End of preview. Expand in Data Studio
README.md exists but content is empty.
- Downloads last month
- 10