Ingest data by using a data pipeline, dataflow, or notebook
Create and manage shortcuts
Implement file partitioning for analytics workloads in a lakehouse
Create views, functions, and stored procedures
Enrich data by adding new columns or tables
Choose an appropriate method for copying data from a Fabric data source to a lakehouse or warehouse
Copy data by using a data pipeline, dataflow, or notebook
Add stored procedures, notebooks, and dataflows to a data pipeline
Schedule data pipelines, dataflows and notebooks
Implement a data cleansing process
Implement a star schema for a lakehouse or warehouse, including Type 1 and Type 2 slowly changing dimensions
Implement bridge tables for a lakehouse or a warehouse
Denormalize data
Aggregate or de-aggregate data
Merge or join data
Identify and resolve duplicate data, missing data, or null values
Convert data types by using SQL or PySpark
Filter data
[Spark DataFrame Where Filter | Multiple Conditions](https://sparkbyexamples.com/spark/spark-dataframe-where-filter/) |
Identify and resolve data loading performance bottlenecks in dataflows, notebooks, and SQL queries
Implement performance improvements in dataflows, notebooks, and SQL queries
Identify and resolve issues with Delta table file sizes