Yarken Help Centre

Creating Pipelines

General prerequisites

These apply to all Cloud Storage Pipelines:

  • You must have the admin rights within the application to create and manage pipelines.

  • The data you plan to import must follow a supported format, such as CSV, GZ, or Parquet.

  • The data mapping templates should be define. For more information about the data mapping template, refer Data Mapping Templates.

  • Required Connections must be configured in advance for the selected cloud storage or API source. For more information on how to configure Cloud Storage connection, refer Creating a new Pipeline Connection | Create Cloud Storage Connection (manual only).

Create a Cloud Storage Pipeline

Cloud Storage Pipelines allow you to automatically fetch usage and billing files from cloud object storage and import them into the application. During setup, you select the storage provider, configure access, choose a data mapping template, define the schedule, and assign a pipeline name. Once created, the pipeline runs automatically based on the configured frequency.

Azure Blob Storage

Prerequisites

Before creating an Azure Cloud Storage Pipeline, ensure you have:

  • A valid Azure Storage account with Blob Storage enabled.

  • An Azure connection configured in YäRKEN using:

    • Storage account name

    • Access key

  • Access to the container and folder path where usage files are stored.

  • Azure usage files available in Blob Storage, such as:

    • Azure Usage Details or

    • Azure Cost Management Export

  • A data mapping template to define how Azure file fields map to YäRKEN fields.

  • File location details, including container name and file path or naming convention.

Procedure

To create,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, choose the storage type as Azure Blob and click NEXT.

Azure blob-20260102-092224.png
  1. Select an existing configured Connection or create a new. For more information on how to configure a connection prior to creating the pipeline, refer Creating a new Pipeline Connection | Create Cloud Storage Connection (manual only).

    Screenshot 2026-01-02 at 3.00.54 PM-20260102-093934.png
    1. (Optional) To create a new connection, click + ADD NEW and enter the below details on Create a new connection dialog box:

      1. Name: The name of the connection.

      2. Account Name: The Azure Storage Account Name for accessing the storage container.

      3. Access Key: The Azure Storage Account Access Key for accessing the storage account data.

      4. Container Name: The name of the container within the Azure Storage Account where the exported files are stored, for example, billing.

      5. Click SAVE.

image-20260102-100010.png
  1. Enter a Directory Name from which you want to retrieve files and select NEXT.

Screenshot 2026-01-02 at 3.32.49 PM-20260102-100321.png
  1. Select a Data Mapping Template from the dropdown list and click NEXT. For more information on Data Mapping Template, refer Data Mapping Templates.

Screenshot 2026-01-02 at 3.47.38 PM-20260102-101803.png
  1. On Schedule tab, select Frequency for the pipeline to run from the dropdown list and then click NEXT:

    1. Hourly: Select the hour(s) between 0 to 12 for the pipeline to start run in the field labelled. The run will occur at your defined intervals.

    2. Daily: Occurs every day, starting at 12:00 AM.

    3. Monthly: Specify the day of the month and select the month interval.

Screenshot 2026-01-02 at 3.43.39 PM-20260102-101858.png
  1. Enter the Pipeline Name and click SAVE.

The new pipeline is created in the application and remains deactivated by default. You must activate the pipeline to start scheduled or on-demand runs.

AWS S3

Prerequisites

Before creating an AWS Cloud Storage Pipeline, ensure you have:

  • A valid AWS S3 account with access to the required bucket.

  • An AWS connection configured in YäRKEN using:

    • Access key

    • Secret key

  • Access to the S3 bucket and folder path where usage files are stored.

  • AWS usage files exported to S3, such as:

    • AWS Cost and Usage Report (CUR)

  • A data mapping template to map fields from AWS usage files to YäRKEN fields.

  • File location details, including folder path and/or file naming patterns.

Procedure

To create,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, choose the storage type as AWS S3 and click NEXT.

Screenshot 2026-01-02 at 4.05.39 PM-20260102-103559.png
  1. Select an existing configured Connection or create a new. For more information on how to configure a connection prior to creating the pipeline, refer Creating a new Pipeline Connection | Create Cloud Storage Connection (manual only).

    image-20260102-103833.png
    1. (Optional) To create a new connection, click + ADD NEW and enter the below details on Create a new connection dialog box:

      1. Name: The name of the connection.

      2. Access Key: The Azure Storage Account Access Key for accessing the storage account data.

      3. Secret Key:  The AWS Secret Key used along with the Access Key to access the S3 storage bucket.

      4. Bucket Name: The name of the S3 bucket where the exported files are being stored, for example, yarken-billing.

      5. Click SAVE.

image-20260102-111653.png
  1. Enter a Directory Name from which you want to retrieve files and select NEXT.

image-20260102-111932.png
  1. Select a Data Mapping Template from the dropdown list and click NEXT. For more information on Data Mapping Template, refer Data Mapping Templates.

Screenshot 2026-01-02 at 3.47.38 PM-20260102-101803.png
  1. On Schedule tab, select Frequency for the pipeline to run from the dropdown list and then click NEXT:

    1. Hourly: Select the hour(s) between 0 to 12 for the pipeline to start run in the field labelled. The run will occur at your defined intervals.

    2. Daily: Occurs every day, starting at 12:00 AM.

    3. Monthly: Specify the day of the month and select the month interval.

Screenshot 2026-01-02 at 3.43.39 PM-20260102-101858.png
  1. Enter the Pipeline Name and click SAVE.

The new pipeline is created in the application and remains deactivated by default. You must activate the pipeline to start scheduled or on-demand runs.

Google Cloud Storage

Prerequisites

Before creating a Google Cloud Storage Pipeline, ensure you have:

  • A valid Google Cloud Storage (GCS) bucket.

  • A Google Cloud Storage connection configured in YäRKEN using:

    • Access key

    • Secret key

  • Access to the GCS bucket and folder path containing usage files.

  • GCP usage files exported to GCS, such as:

    • GCP Billing Export files

  • A data mapping template to map GCP file fields to YäRKEN fields.

  • File location details, including bucket path and file naming patterns.

Procedure

To create,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, choose the storage type as Google Cloud Storage and click NEXT.

Screenshot 2026-01-02 at 4.54.25 PM-20260102-112453.png
  1. Select an existing configured Connection or create a new. For more information on how to configure a connection prior to creating the pipeline, refer Creating a new Pipeline Connection | Create Cloud Storage Connection (manual only).

    image-20260102-112609.png
    1. (Optional) To create a new connection, click + ADD NEW and enter the below details on Create a new connection dialog box:

      1. Name: The name of the connection.

      2. Access Key: The access key used to authenticate and authorize the access to the storage service.

      3. Secret Key: A secret key associated with the access key used to authenticate and authorize secure access to the storage service.

      4. Bucket Name: The name of the GCS bucket where the exported files (objects) are being stored, for example, yarken-billing.

      5. Click SAVE.

Screenshot 2026-01-02 at 4.58.51 PM-20260102-112929.png
  1. Enter a Directory Name from which you want to retrieve files and select NEXT.

image-20260102-113029.png
  1. Select a Data Mapping Template from the dropdown list and click NEXT. For more information on Data Mapping Template, refer Data Mapping Templates.

Screenshot 2026-01-02 at 3.47.38 PM-20260102-101803.png
  1. On Schedule tab, select Frequency for the pipeline to run from the dropdown list and then click NEXT:

    1. Hourly: Select the hour(s) between 0 to 12 for the pipeline to start run in the field labelled. The run will occur at your defined intervals.

    2. Daily: Occurs every day, starting at 12:00 AM.

    3. Monthly: Specify the day of the month and select the month interval.

Screenshot 2026-01-02 at 3.43.39 PM-20260102-101858.png
  1. Enter the Pipeline Name and click SAVE.

The new pipeline is created in the application and remains deactivated by default. You must activate the pipeline to start scheduled or on-demand runs.

Create an API Pipeline manually

Note: Manual API pipeline creation is not recommended because pipelines are automatically created via the Consent URL. These steps should be used only when manual setup is required.

As part of the consent-link process, when a tenant accepts the consent links, the application automatically creates the API connections and their corresponding API pipelines. Use the below section to create an API Pipelines manually.

Microsoft Graph API

The Microsoft Graph API pipeline is used to collect:

  • License details

  • Product usage details

  • Entra ID activity details

  • Teams activity details

  • Defender enrolled device details

Prerequisite

  • A Microsoft Graph API connection must already be configured in the application.

Procedure

To configure,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, select Microsoft Graph API and click NEXT.

Screenshot 2026-01-03 at 2.46.27 PM-20260103-091647.png
  1. Under Select a connection, click the dropdown icon and select already configured API connection for Microsoft Graph API.

image-20260103-092113.png
  1. Select the API data to sync and click NEXT:

    1. License Details

    2. Product Usage Details

    3. Entra ID Activity Details

    4. Teams Activity Details

    5. Defender Enrolled Details

image-20260103-093157.png
  1. On the Schedule tab, choose Monthly frequency and select Day 7 of every 1 month (recommended).

image-20260103-092957.png
  1. Click NEXT.

  2. On the Details tab, enter a Pipeline Name and click SAVE.

image-20260103-093512.png
  1. The new pipeline is created in a deactivated state. Activate it using the Activate toggle and click OK on confirmation popup.

image-20260103-094412.png
  1. Once activated, the pipeline runs on the scheduled timeframe. You may also use Run Once to trigger an immediate execution.

Screenshot 2026-01-03 at 3.16.49 PM-20260103-094752.png

The YäRKEN application immediately starts fetching data. You can click View Run History icon to check the last running status.

Screenshot 2026-01-03 at 3.23.22 PM-20260103-095444.png

Power BI API

For Power BI, two separate pipelines must be created using the same Power BI connection:

  • One for daily usage details

  • One for monthly workspace details.

Prerequisite

  • A Power BI API connection must already be configured.

Procedure

Pipeline 1: Power BI Usage Details (Daily)

To configure,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, select Power BI API and click NEXT.

Screenshot 2026-01-03 at 3.34.53 PM-20260103-100509.png


  1. Under Select a connection, click the dropdown icon and select already configured API connection for Power BI API.

Screenshot 2026-01-03 at 3.37.37 PM-20260103-100755.png
  1. Select Power BI Usage Details to sync and click NEXT.

Screenshot 2026-01-03 at 3.39.40 PM-20260103-101002.png
  1. On the Schedule tab, select Daily frequency.

image-20260103-101250.png
  1. Click NEXT.

  2. On the Details tab, enter a Pipeline Name and click SAVE.

image-20260103-093512.png
  1. The new pipeline is created in a deactivated state. Activate it using the Activate toggle and click OK on confirmation popup.

Screenshot 2026-01-03 at 3.46.19 PM-20260103-101730.png
  1. Once activated, the pipeline runs on the scheduled timeframe. You may also use Run Once to trigger an immediate execution.

Screenshot 2026-01-03 at 3.48.31 PM-20260103-101908.png

The YäRKEN application immediately starts fetching data. You can click View Run History icon to check the last running status.

Screenshot 2026-01-03 at 3.50.46 PM-20260103-102121.png

Pipeline 2: Power BI Workspace Details (Monthly)

Procedure

To configure,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, select Power BI API and click NEXT.

Screenshot 2026-01-03 at 3.34.53 PM-20260103-100509.png


  1. Under Select a connection, click the dropdown icon and select already configured API connection for Power BI API.

Screenshot 2026-01-03 at 3.37.37 PM-20260103-100755.png
  1. Select Power BI Workspace Details to sync and click NEXT.

Screenshot 2026-01-03 at 3.53.49 PM-20260103-102406.png


  1. On the Schedule tab, select Monthly frequency and choose Day 7 of every 1 month.

image-20260103-092957.png
  1. Click NEXT.

  2. On the Details tab, enter a Pipeline Name and click SAVE.

image-20260103-093512.png
  1. The new pipeline is created in a deactivated state. Activate it using the Activate toggle and click OK on confirmation popup.

Screenshot 2026-01-03 at 3.46.19 PM-20260103-101730.png
  1. Once activated, the pipeline runs on the scheduled timeframe. You may also use Run Once to trigger an immediate execution.

Screenshot 2026-01-03 at 3.48.31 PM-20260103-101908.png

The YäRKEN application immediately starts fetching data. You can click View Run History icon to check the last running status.

Screenshot 2026-01-03 at 3.50.46 PM-20260103-102121.png

Google Cloud API

The Google Cloud API pipeline is used to collect:

  • Sign-in details

  • Product usage details

Prerequisite

  • A Google API connection must be configured before creating the pipeline.

Procedure

To configure,

  1. Navigate to Admin > Pipelines > Pipelines.

  2. On the Pipelines page, click + NEW PIPELINE.

  3. On the Create New Pipeline dialog box, select Google Cloud API and click NEXT.

Screenshot 2026-01-03 at 4.03.00 PM-20260103-103317.png
  1. Under Select a connection, click the dropdown icon and select already configured API connection for Google Cloud API.

image-20260103-103634.png


  1. Select the API data to sync and click NEXT:

    1. Sign In Details

    2. Product Usage Details

image-20260103-103749.png


  1. On the Schedule tab, select Monthly frequency and choose Day 7 of every 1 month (recommended).

image-20260103-092957.png
  1. Click NEXT.

  2. On the Details tab, enter a Pipeline Name and click SAVE.

image-20260103-093512.png
  1. The new pipeline is created in a deactivated state. Activate it using the Activate toggle and click OK on confirmation popup.

Screenshot 2026-01-03 at 4.13.40 PM-20260103-104426.png
  1. Once activated, the pipeline runs on the scheduled timeframe. You may also use Run Once to trigger an immediate execution.

Screenshot 2026-01-03 at 4.15.16 PM-20260103-104549.png


The YäRKEN application immediately starts fetching data. You can click View Run History icon to check the last running status.

image-20260103-104837.png

Related content

Pipelines

Managing Pipelines

Creating a new Pipeline Connection | Create API Connection using Consent URL

Creating a new Pipeline Connection | Create API Connection manually

Creating a new Pipeline Connection | Create Cloud Storage Connection (manual only)