Activity
Mon
Wed
Fri
Sun
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
What is this?
Less
More

Memberships

Learn Microsoft Fabric

Public • 5.7k • Free

Fabric Dojo 织物

Private • 208 • $39/m

104 contributions to Learn Microsoft Fabric
DP-600 Passed !
I passed DP-600 today... A huge thanks to this community and to you @Will Needham ! Your support and guidance have been invaluable... Thank you all for being such a fantastic source of knowledge and encouragement !
13
13
New comment Oct 23
1 like • Oct 14
Congratulations Stephane 🥳
Data Skew in Spark notebook
I’ve been encountering data skew issues in my Spark notebook. I’ve tried implementing salting and repartitioning with different columns, but regardless of the combinations or the number of partitions (up to 200), the result is still skewed data. I’m running out of ideas on how to resolve this. Has anyone else faced similar issues or have any suggestions?
1
4
New comment 3d ago
1 like • Aug 24
Hi Stephane, I see you have already tried different salting and repartitioning techniques, ideally that should fix the skewness. I think maybe you have find right columns. Let's connect over DM
Future of Lakehouse vs Warehouse
Hey All, looking into in this release plan, it sparked a question in me if you see the highlighted features related to lakehouse and warehouse in the attached screenshot. Warehouse: To get Spark support. Lakehouse: To get schema support and additional security features like OLS. I mean, till now all the discussions I have seen, Lakehouse was chosen for Spark and Warehouse for Security. Now if both gets both, would be an interesting discussion to start on how the future of lakehouse vs warehouse do you see? When would you pick what, or will they even get merged (too far I know😂) ?
10
6
New comment Jul 16
Future of Lakehouse vs Warehouse
0 likes • Jul 16
@Mats Ka right, but I think based on how lakehouse is build from backend, it doesn't work or behave like traditional databases. So it may lack those features and functionalities which data warehouse would have for example functions, stored procedures, system tables etc.
1 like • Jul 16
@Mats Ka you're right. It comes down to how the project architecture is setup. Few reasons to choose DWH apart from the obvious one: 1. Security and more granular access controls. Like: RLS, OLS. This is one of the reason why DWH is suggested for Gold layer, if using medallion architecture. 2. Additional features like functions and stored proc. For ETL projects with metadata driven data pipeline, DWH is used as storage for metadata tables as they can be effectively interacted from the pipelines using the stored proc activities. There can be more reasons, these two came on top of my mind
Casestudies questions time management
Can anybody tell me how many casestudies will be there in exam? And approximately how many questions on case study?
0
8
New comment Jun 14
2 likes • Jun 13
I got 1 case study with 9 questions. I felt less on time. Would recommend you finishing the first section with atleast 20mins in hand for the case study.
Databricks open sources Unity Catalog - coming to Fabric soon? 👀
Did you see the news today that Databricks announced they are open-sourcing Unity Catalog - pretty interesting! Will be interesting to see if Microsoft do anything about that... What are you your thoughts? There is a quote from Jessica Hawk (CVP, Data, AI, Digital Applications, Microsoft) in the article: "Microsoft is committed to the open-source community and empowering customers with choice. Databricks has been a strategic partner for years and it's great to see them open-sourcing Unity Catalog. We believe truly open standards with broad industry participation are in customers' best interests. Our collaboration with Databricks continues to elevate Microsoft Azure as the best choice for data and AI workloads,"
22
9
New comment Jun 14
0 likes • Jun 13
Wow! That's interesting, how companies now start making use of it
2 likes • Jun 13
@Nam Le yeah, another fight for Apache Icheberg, which Databricks won by buying out Tabular😂. I see big plans towards one common open source format for lakehouses (now they already Delta lake and Iceberg), they might merge them to create one unified version which becomes industry standard for all to use.
1-10 of 104
Vinayak K
5
235points to level up
@vinayak-k-8946
Data Engineer | Aspiring Solution Architect & MS Fabric Expert

Active 2d ago
Joined Mar 15, 2024
powered by