Privacy, security, and responsible use of Copilot for Data Factory (preview)
With Copilot for Data Factory in Microsoft Fabric and other generative AI features in preview, Microsoft Fabric brings a new way to transform and analyze data, generate insights, and create visualizations and reports in Data Science and the other workloads.
Before your business can start using Copilot capabilities in Fabric, your administrator needs to enable Copilot in Microsoft Fabric. You may have questions about how it works, how it keeps your business data secure and adheres to privacy requirements, and how to use generative AI responsibly.
The article Privacy, security, and responsible use for Copilot (preview) provides an overview of Copilot in Fabric. Read on for details about Copilot for Data Factory.
Note
- Your administrator needs to enable the tenant switch before you start using Copilot. See the article Copilot tenant settings for details.
- Your F64 or P1 capacity needs to be in one of the regions listed in this article, Fabric region availability.
- If your tenant or capacity is outside the US or France, Copilot is disabled by default unless your Fabric tenant admin enables the Data sent to Azure OpenAI can be processed outside your tenant's geographic region, compliance boundary, or national cloud instance tenant setting in the Fabric Admin portal.
- Copilot in Microsoft Fabric isn't supported on trial SKUs. Only paid SKUs (F64 or higher, or P1 or higher) are supported.
- Copilot in Fabric is currently rolling out in public preview and is expected to be available for all customers by end of March 2024.
- See the article Overview of Copilot in Fabric and Power BI for more information.
Capabilities and intended uses of Copilot for Data Factory
The Copilot features in Data Factory currently support use in Dataflow Gen2. These features include the Copilot chat pane and suggested transformations.
Copilot has the following intended uses:
- Provide a summary of the query and the applied steps.
- Generate new transformation steps for an existing query.
- Generate a new query that may include sample data or a connection to a data source that requires configuring authentication.
Limitations of Copilot for Data Factory
Here are the current limitations of Copilot for Data Factory:
- Copilot can't perform transformations or explanations across multiple queries in a single input. For instance, you can't ask Copilot to "Capitalize all the column headers for each query in my dataflow."
- Copilot doesn't understand previous inputs and can't undo changes after a user commits a change when authoring, either via user interface or the chat pane. For example, you can't ask Copilot to "Undo my last 5 inputs." However, users can still use the existing user interface options to delete unwanted steps or queries.
- Copilot can't make layout changes to queries in your session. For example, if you tell Copilot to create a new group for queries in the editor, it doesn't work.
- Copilot may produce inaccurate results when the intent is to evaluate data that isn't present within the sampled results imported into the sessions data preview.
- Copilot doesn't produce a message for the skills that it doesn't support. For example, if you ask Copilot to "Perform statistical analysis and write a summary over the contents of this query", it doesn't complete the instruction successfully as mentioned previously. Unfortunately, it doesn't give an error message either.
Data use of Copilot for Data Factory
- Copilot can only access data that is accessible to the user's current Gen2 dataflow session, and that is configured and imported into the data preview grid. Learn more about getting data in Power Query.
Evaluation of Copilot for Data Factory
- The product team has tested Copilot to see how well the system performs within the context of Gen2 dataflows, and whether AI responses are insightful and useful.
- The team also invested in other harms mitigations, including technological approaches to focusing Copilot's output on topics related to data integration.
Tips for working with Copilot for Data Factory
- Copilot is best equipped to handle data integration topics, so it's best to limit your questions to this area.
- If you include descriptions such as query names, column names, and values in the input, Copilot is more likely to generate useful outputs.
- Try breaking complex inputs into more granular tasks. This helps Copilot better understand your requirements and generate a more accurate output.
Notes by release
Additional information for future releases or feature updates will appear here.
Related content
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for