title | description | author | ms.author | ms.topic | ms.custom | ms.date | ms.search.form |
---|---|---|---|---|---|---|---|
How to access on-premises data sources in Data Factory |
This article describes how to configure a gateway to access on-premises data sources from Data Factory for Microsoft Fabric. |
lrtoyou1223 |
lle |
how-to |
01/24/2025 |
On-premises data sources gateway |
Data Factory for Microsoft Fabric is a powerful cloud-based data integration service that allows you to create, schedule, and manage workflows for various data sources. In scenarios where your data sources are located on-premises, Microsoft provides the On-Premises Data Gateway to securely bridge the gap between your on-premises environment and the cloud. This document guides you through the process of accessing on-premises data sources within Data Factory for Microsoft Fabric using the On-Premises Data Gateway.
-
An on-premises data gateway is a software application designed to be installed within a local network environment. It provides a means to directly install the gateway onto your local machine. For detailed instructions on how to download and install the on-premises data gateway, refer to Install an on-premises data gateway.
:::image type="content" source="media/how-to-access-on-premises-data/gateway-setup.png" alt-text="Screenshot showing the on-premises data gateway setup.":::
-
Sign-in using your user account to access the on-premises data gateway, after which it's prepared for utilization.
:::image type="content" source="media/how-to-access-on-premises-data/gateway-setup-after-sign-in.png" alt-text="Screenshot showing the on-premises data gateway setup after the user signed in.":::
Note
An on-premises data gateway of version higher than or equal to 3000.214.2 is required to support Fabric pipelines.
-
Navigate to the admin portal and select the settings button (an icon that looks like a gear) at the top right of the page. Then choose Manage connections and gateways from the dropdown menu that appears.
:::image type="content" source="media/how-to-access-on-premises-data/manage-connections-gateways.png" alt-text="Screenshot showing the Settings menu with Manage connections and gateways highlighted.":::
-
On the New connection dialog that appears, select On-premises and then provide your gateway cluster, along with the associated resource type and relevant information.
:::image type="content" source="media/how-to-access-on-premises-data/new-connection-details.png" alt-text="Screenshot showing the New connection dialog with On-premises selected.":::
Available connection types supported for on-premises connections include:
- Entra ID
- Adobe Analytics
- Analysis Services
- Azure Blob Storage
- Azure Data Lake Storage Gen2
- Azure Table Storage
- Essbase
- File
- Folder
- Google Analytics
- IBM DB2
- MySQL
- OData
- ODBC
- OLE DB
- Oracle
- PostgreSQL
- Salesforce
- SAP Business Warehouse Message Server
- SAP Business Warehouse Server
- SAP HANA
- SharePoint
- SQL Server
- Sybase
- Teradata
- Web
For a comprehensive list of the connectors supported for on-premises data types, refer to Data pipeline connectors in Microsoft Fabric.
-
Go to your workspace and create a Dataflow Gen2.
:::image type="content" source="media/how-to-access-on-premises-data/create-new-dataflow.png" alt-text="Screenshot showing a demo workspace with the new Dataflow Gen2 option highlighted.":::
-
Add a new source to the dataflow and select the connection established in the previous step.
:::image type="content" source="media/how-to-access-on-premises-data/connect-data-source.png" lightbox="media/how-to-access-on-premises-data/connect-data-source.png" alt-text="Screenshot showing the Connect to data source dialog in a Dataflow Gen2 with an on-premises source selected.":::
-
You can use the Dataflow Gen2 to perform any necessary data transformations based on your requirements.
:::image type="content" source="media/how-to-access-on-premises-data/transform-data-inline.png" lightbox="media/how-to-access-on-premises-data/transform-data.png" alt-text="Screenshot showing the Power Query editor with some transformations applied to the sample data source.":::
-
Use the Add data destination button on the Home tab of the Power Query editor to add a destination for your data from the on-premises source.
:::image type="content" source="media/how-to-access-on-premises-data/add-destination-inline.png" lightbox="media/how-to-access-on-premises-data/add-destination.png" alt-text="Screenshot showing the Power Query editor with the Add data destination button selected, showing the available destination types.":::
-
Publish the Dataflow Gen2.
:::image type="content" source="media/how-to-access-on-premises-data/publish-dataflow-inline.png" lightbox="media/how-to-access-on-premises-data/publish-dataflow.png" alt-text="Screenshot showing the Power Query editor with the Publish button highlighted.":::
Now you've created a Dataflow Gen2 to load data from an on-premises data source into a cloud destination.
-
Go to your workspace and create a data pipeline.
:::image type="content" source="media/how-to-access-on-premises-data/create-pipeline.png" alt-text="Screenshot showing how to create a new data pipeline.":::
Note
You need to configure the firewall to allow outbound connections *.frontend.clouddatahub.net from the gateway for Fabric pipeline capabilities.
-
From the Home tab of the pipeline editor, select Copy data and then Use copy assistant. Add a new source to the activity in the assistant's Choose data source page, then select the connection established in the previous step.
:::image type="content" source="media/how-to-access-on-premises-data/choose-data-source.png" lightbox="media/how-to-access-on-premises-data/choose-data-source.png" alt-text="Screenshot showing where to choose a new data source from the Copy data activity.":::
-
Select a destination for your data from the on-premises data source.
:::image type="content" source="media/how-to-access-on-premises-data/choose-destination.png" lightbox="media/how-to-access-on-premises-data/choose-destination.png" alt-text="Screenshot showing where to choose the data destination in the Copy activity.":::
-
Run the pipeline.
:::image type="content" source="media/how-to-access-on-premises-data/run-pipeline.png" lightbox="media/how-to-access-on-premises-data/run-pipeline.png" alt-text="Screenshot showing where to run the pipeline in the pipeline editor window.":::
Now you've created and ran a pipeline to load data from an on-premises data source into a cloud destination.
Note
Local access to the machine with the on-premises data gateway installed is not allowed in data pipelines.