Private.txt — 240k
: The messages were cleaned by removing group chats and unknown contacts, then grouped into "chunks" of 200 tokens to serve as training prompts for the AI.
: Microsoft Excel has a known limit for .prn (space-delimited) files where formatted text is limited to 240 characters per line . 240k private.txt
The most relevant reference is a project by Edward Donner , where he fine-tuned a Large Language Model (LLM) on his own private history of 240,805 text messages to create a digital simulation of himself. : The messages were cleaned by removing group
: The data consisted of SMS, iMessage, and WhatsApp conversations with 288 people. : The data consisted of SMS, iMessage, and
The search results do not contain a specific file named "240k private.txt" or its direct contents. However, the query likely refers to a well-known project or dataset involving used for machine learning. Likely Context: The "240k Text Messages" Project
A simulation of me: fine-tuning an LLM on 240k text messages