Dataset Builder File Types
CSV vs. JSON Lines Files The dataset builder creates two files: A CSV file containing only metadata A JSON Lines file containing metadata and the textual data The textual data includes: Unigrams Bigrams Trigrams Full Text (where available) The metadata may include: Column Name Description id a unique item ID
Can I download a dataset I created in the builder?
Download from your dashboard There is a download link in your Dashboard with options to download the metadata only (.csv) or Metadata with n-grams (.jsonl). What's the difference between the JSON-L dataset file and metadata CSV? Download from The Jupyter Notebook If you have used the tdm_client to pull