# Datasets - Running AI tasks on files Promptloop streamlines the process of applying AI models to your data. We do this with datasets. Upload a CSV or Excel file, select relevant inputs for the model to use, and launch AI tasks on hundreds to thousands of rows. Datasets are where you will store, edit, and view data that you are processing with AI tasks. **Helpful Links** - [Datasets Page](/account/datasets) - [Create a task](/account/custom/new) ## Getting Started This video overview provides a guide of how to get started and take full advantage of datasets. <VideoLink video_src='https://share.descript.com/view/y6OY0qHqVyk' image_src='https://web-public-photos.s3.amazonaws.com/datasetview.jpg'/> **Files and Versions** Datasets are where you can upload the data you are working with on the PromptLoop Platform. **Uploading** You can upload data to launch a job, using an existing AI task, or upload a dataset to filter and analyze first before running. **Run an AI task on each row** When you run an AI task on a dataset, the job will run in the background and save as a new version in the dataset when its complete. This will allow you to run tens of thousands of operations at once and freeing you up for other work. <Callout title="Limits"> If you have questions about limits, or have a task that requires a large amount of rows, reach out to the team [here](/demo). </Callout> **Reliable and Scalable** Datasets are an extremely powerful feature available to all users and one of the core capabilities of how we think about leveraging AI models efficiently. ## How to run an AI task on a spreadsheet file upload ### Step 1: Upload a file with data Datasets let you use any data table in an Excel or CSV file. Our systems automatically detect columns and let you select which you want to use. - You can select the columns that you will use as inputs for the task - this is often a single column like a website or search term - Results from your task - new columns or rows - will be added into the uploaded sheet and available to download as a new file <Image src='https://web-public-photos.s3.amazonaws.com/launch%20job.jpg' alt='Choose a task' width={800} height={420} /> ### Step 2: Launch the Job Select your task and launch the job with the correct input columns. Jobs are immediately added to the queue based on your account tier and capacity. You will see progress and results once the task is running. Even extremely large jobs usually complete within 90 minutes. <Image src='https://web-public-photos.s3.amazonaws.com/job%20progress.jpg' alt='Choose a task' width={800} height={420} /> If you do not already have an AI task created, use the editor to create one or copy a template to edit from the template library. The [task creation tool](/account/custom/new) guides will help you get started. <Image src='https://web-public-photos.s3.amazonaws.com/launch%20job.jpg' alt='Choose a task' width={800} height={420} /> ### Step 3: View results Results are saved as a new dataset version for review. You can then search and filter data before exporting and using the results of the task. You can also run data on another task right from the datasets page. <Image src='https://web-public-photos.s3.amazonaws.com/datasetview.jpg' alt='Choose a task' width={800} height={420} /> For help setting up your first datasets, or questions about capacity and running large files, reach out the the team or book time with us to let us help you. [Book a Session](/demo) ## Integrations ### Hubspot To connect to hubspot you need a team subscription. The connection is shared across the team, so only one team member will need to connect. The Hubspot integration can be found under the Integration tab of the settings section. #### Connecting to Hubspot Promptloop uses a standard oauth connection with hubspot. From your logged in promptloop settings page select the orange **Connect to Hubspot** button. This will bring you to hubspot where you will give Promptloop permission to view and edit your Company values. These values are never edited without user selection. After accepting the connection you should be redirected to the Integrations page and should now see 'Connected' along with options to **Import List from Hubspot** and **Set up Autosync**. #### Importing a list from Hubspot Upon selection of this you can either choose an existing list to import, or select your entire list of companies. Once you select the source, you can choose which columns to import. We recommend only importing the needed columns, as otherwise the resulting dataset will be unnecessarily large. The unique hubspot identifier is always imported so that the records can be updated after you run a Promptloop task on them. After confirming which columns you want to import select **Import**. This can take anywhere from 1 - 5 minutes depending on the size of the list. If you reload the integrations page you will see your list import in progress. Once it says **Synced** you can click the link to go to the dataset. You can now run any task on this data. Be sure to not delete the hs_object_id column. #### Updating a dataset to Hubspot Whenever you are ready you can select **Sync to Hubspot** from the dataset page. You will be able to select which of the dataset columns you want to update into Hubspot and then select the corresponding Hubspot column. If you want to create a new column, you can toggle **new**. Then click Sync. This will update each record in the dataset, using the selected version. #### Disconnecting Hubspot To disconnect hubspot select the disconnect button and follow the prompts.

    Datasets - Running AI tasks on files

    Promptloop streamlines the process of applying AI models to your data. We do this with datasets. Upload a CSV or Excel file, select relevant inputs for the model to use, and launch AI tasks on hundreds to thousands of rows.

    Datasets are where you will store, edit, and view data that you are processing with AI tasks.

    Helpful Links

    Getting Started

    This video overview provides a guide of how to get started and take full advantage of datasets.

    Watch the video

    Files and Versions Datasets are where you can upload the data you are working with on the PromptLoop Platform.

    Uploading You can upload data to launch a job, using an existing AI task, or upload a dataset to filter and analyze first before running.

    Run an AI task on each row When you run an AI task on a dataset, the job will run in the background and save as a new version in the dataset when its complete. This will allow you to run tens of thousands of operations at once and freeing you up for other work.

    Limits

    If you have questions about limits, or have a task that requires a large amount of rows, reach out to the team here.

    Reliable and Scalable Datasets are an extremely powerful feature available to all users and one of the core capabilities of how we think about leveraging AI models efficiently.

    How to run an AI task on a spreadsheet file upload

    Step 1: Upload a file with data

    Datasets let you use any data table in an Excel or CSV file. Our systems automatically detect columns and let you select which you want to use.

    • You can select the columns that you will use as inputs for the task - this is often a single column like a website or search term
    • Results from your task - new columns or rows - will be added into the uploaded sheet and available to download as a new file
    Choose a task

    Step 2: Launch the Job

    Select your task and launch the job with the correct input columns. Jobs are immediately added to the queue based on your account tier and capacity. You will see progress and results once the task is running. Even extremely large jobs usually complete within 90 minutes.

    Choose a task

    If you do not already have an AI task created, use the editor to create one or copy a template to edit from the template library. The task creation tool guides will help you get started.

    Choose a task

    Step 3: View results

    Results are saved as a new dataset version for review. You can then search and filter data before exporting and using the results of the task. You can also run data on another task right from the datasets page.

    Choose a task

    For help setting up your first datasets, or questions about capacity and running large files, reach out the the team or book time with us to let us help you. Book a Session

    Integrations

    Hubspot

    To connect to hubspot you need a team subscription. The connection is shared across the team, so only one team member will need to connect.

    The Hubspot integration can be found under the Integration tab of the settings section.

    Connecting to Hubspot

    Promptloop uses a standard oauth connection with hubspot. From your logged in promptloop settings page select the orange Connect to Hubspot button. This will bring you to hubspot where you will give Promptloop permission to view and edit your Company values. These values are never edited without user selection.

    After accepting the connection you should be redirected to the Integrations page and should now see 'Connected' along with options to Import List from Hubspot and Set up Autosync.

    Importing a list from Hubspot

    Upon selection of this you can either choose an existing list to import, or select your entire list of companies. Once you select the source, you can choose which columns to import. We recommend only importing the needed columns, as otherwise the resulting dataset will be unnecessarily large. The unique hubspot identifier is always imported so that the records can be updated after you run a Promptloop task on them. After confirming which columns you want to import select Import. This can take anywhere from 1 - 5 minutes depending on the size of the list. If you reload the integrations page you will see your list import in progress. Once it says Synced you can click the link to go to the dataset. You can now run any task on this data. Be sure to not delete the hs_object_id column.

    Updating a dataset to Hubspot

    Whenever you are ready you can select Sync to Hubspot from the dataset page. You will be able to select which of the dataset columns you want to update into Hubspot and then select the corresponding Hubspot column. If you want to create a new column, you can toggle new. Then click Sync. This will update each record in the dataset, using the selected version.

    Disconnecting Hubspot

    To disconnect hubspot select the disconnect button and follow the prompts.