Data Management

You can upload data through our API or by uploading the JSON/CSV/TSV/ZIP/RAR files. You can also upload more data later in Data Manager. See examples for your project in the Upload dialog.

All text resources can be included in tasks directly, and advanced hosting is not necessary.

Upload resources

Images, audio, video, and other external files must be uploaded to any hosting with the http/https access. Your JSON/CSV/TSV must contain proper http/https URLs to them.

Let’s consider an example: upload images to Amazon AWS and prepare the CSV task file.

  1. If you don’t have AWS account, make it here: https://console.aws.amazon.com
  2. Create a new S3 bucket
  3. Upload resource files to bucket via Amazon web interface or API:
  4. Create the CSV task file
  5. Go to your folder with images
$ cd /path/to/images
``` 
    
4.2 Save this script to run.sh in the current folder 

```bash
filter="*.jpg"
column_name="image"
out_file="example.csv"
root="s3.amazonaws.com/<bucket_name>/"  # insert your AWS root

echo $column_name > $out_file  # write header
for var in $(ls $filter)
do
  echo "$root$var" >> $out_file  # write rows
done

4.3 Obtain the result

$ bash run.sh  # run our script
$ cat example.csv  # show the resulting CSV file
image
s3.amazonaws.com/<bucket_name>/1.jpg
s3.amazonaws.com/<bucket_name>/2.jpg

Now you may upload this CSV file to Heartex Platform.

Management

The administrator of the project has full control of data, experts, and statistics about the labeling process. Clicking on the pencil button opens up the editor for a particular item in your dataset.

Press the «Add more data» button to add data samples into the system in JSON/CSV format.

Press the pencil button, and the editor starts to label this data sample. See the next page.

Download Results

So you have been working hard labeling your data and have accumulated a respectable amount. How do you get the data out of the application and onto your computer? Heartex provides a download function that works one of three ways depending on the state and settings of your project:

You have configuration around the configuration of aggregation and how much of meta-information shall be included in the results.

The downloaded results are JSON formatted.

We also support all operations at the API level.

Data formats

Task formats could be viewed by following this link in your browser (change <task_id> for the real task ID, e.g., 2353):

curl https://go.heartex.net/api/tasks/<task_id>/

The following format specifies a Task:

{
  "id": 2353,
  "data": {
    "my_image_url": "https://go.heartex.net/static/samples/kittens.jpg"
  },
  "accuracy": 0.0,
  "created_at": "2019-02-04T20:33:51.361394Z",
  "updated_at": "2019-02-04T20:33:51.361430Z",
  "is_labeled": false,
  "project": 2
}

Note that only "data" field is required (other ones are optional and/or automatically generated by platform) Uploading list of new tasks could use only this field:

[{
  "data": {
    "my_image_url": "https://go.heartex.net/static/samples/kittens1.jpg"
  }
}, {
  "data": {
    "my_image_url": "https://go.heartex.net/static/samples/kittens2.jpg"
}}]