Part 6: Downloading Labeled Data and/or Model

So you have been working hard labeling your data and have accumulated a respectable amount. How do you get the data out of the application and onto your computer? SMART provides a download function that works one of three ways depending on the state and settings of your project:

  1. If your project has no data labeled, then the download button does nothing and will display “No Labeled Data to Download”.

  2. If your project is not using a model or the requirements for a model to run have not yet been met (see Refilling the Batch), then the download button will display “Download Labeled Data” and output a comma separated value (.csv) file of the labeled data with the columns ID (for the unique ID of the data), Text, and Label. The data is sorted by Label.

  3. If your project has a model, then the download button will display “Download Model and Labeled Data”. This will output a zip file with:

    1. The labeled data file (see number 2)
    2. A csv with the labels and their internal ID’s assigned by the application
    3. A pickle (.pkl) file with the preprocessed version of your input data as a TFIDF matrix
    4. A pickle file with the trained classifier model
    5. A pickle file with the trained Vectorizer used to preprocess data into the TFIDF format
    6. A README with detailed descriptions of the files and sample code on how to preprocess new data and predict it with your trained model.

    annotate-downloadmodel-files

This button is available in one of two places.

  • The Projects Page:

annotate-downloadmodel-projectsview

annotate-downloadmodel-nodatabutton OR annotate-downloadmodel-databutton OR annotate-downloadmodel-modelanddatabutton