Supported File Formats
When writes data to Microsoft OneLake, you can choose the file format for the exported data. The following file formats are supported for the Microsoft OneLake destination:- CSV—Plain text comma-separated values.
- Avro—A row-based binary format that supports schema evolution.
- (Default) Parquet—A columnar storage format that is optimized for analytics.
Authenticate to Microsoft OneLake
After you add the connector, you need to set the required properties.- File Format: Select the file format that you want to use: CSV , Avro, or Parquet (default).
- URI: Enter the path of the file system and folder that contains your files (for example, onelake://WorkspaceIdentifier/DatabaseIdentifier/Files/CustomFolder).
- Azure Active Directory (default)
- Azure Managed Service Identity
- Azure Service Principal
- Azure Service Principal Certificate
Azure Active Directory
To connect with an Azure Active Directory (AD) user account, specify the following properties:- Auth Scheme: Select AzureAD.
- Use Lake Formation: Select whether you want the AWS Lake Formation service to retrieve temporary credentials. These temporary credentials enforce access policies against the user based on the configured IAM role. You can use this service when you authenticate through AzureAD, Okta, ADFS, and PingFederate, while providing a Security Assertion Markup Language (SAML) assertion. By default, the Enable checkbox is not selected.
Azure Managed Service Identity
Azure Service Principal
Azure Service Principal Certificate
Complete Your Connection
To complete your connection:-
Specify the following properties:
For the CSV file format:
- FMT: Enter the format that you want to use to parse all text files. The default format is CsvDelimited.
- Aggregate Files: Select whether you want to aggregate all the files that are located in the URI directory and that have the same schema into a single table named AggregatedFiles. By default, the Enable checkbox is not selected.
- Include Column Headers: Select whether you want to obtain column headers from the first lines of the specified files. By default, the Enable checkbox is already selected.
- Data Model: Select the data model that you want to use to parse documents for your format and to generate the database metadata. The default data model is Document.
- Aggregate Files: Select whether you want to aggregate all the files that are located in the URI directory and that have the same schema into a single table named AggregatedFiles. By default, the Enable checkbox is not selected.
- Define advanced connection settings on the Advanced tab. (In most cases, though, you should not need these settings.)
- If you authenticate with AzureAD, click Connect to Microsoft OneLake to connect to your Microsoft OneLake account.
- Click Create & Test to create your connection.