Blog | luminousmen
Subscribe
Sign in
Home
Notes
Spark Under the Hood
Archive
About
Latest
Top
Discussions
Anatomy of Apache Spark Application
Apache Spark job autopsy
Aug 12
•
luminousmen
7
Share this post
Blog | luminousmen
Anatomy of Apache Spark Application
Copy link
Facebook
Email
Notes
More
July 2025
Choosing the Right Compression Codec
A Guide for People Who Move Data
Jul 29
•
luminousmen
3
Share this post
Blog | luminousmen
Choosing the Right Compression Codec
Copy link
Facebook
Email
Notes
More
Apache Spark Core Concepts Explained
Let's deep dive into Apache Spark core abstractions
Jul 22
•
luminousmen
10
Share this post
Blog | luminousmen
Apache Spark Core Concepts Explained
Copy link
Facebook
Email
Notes
More
Cluster Managers for Apache Spark: from YARN to Kubernetes
Deep dive into machinery that orchestrates Spark
Jul 8
•
luminousmen
6
Share this post
Blog | luminousmen
Cluster Managers for Apache Spark: from YARN to Kubernetes
Copy link
Facebook
Email
Notes
More
June 2025
Words Are the New Bytecode
You're not a full-stack developer anymore. You're a full-stack thinker.
Jun 24
•
luminousmen
1
Share this post
Blog | luminousmen
Words Are the New Bytecode
Copy link
Facebook
Email
Notes
More
Only the Squeaky Wheel Gets the Oil
In an ideal world, your work speaks for itself. But we don't live in a fairy tale.
Jun 17
•
luminousmen
6
Share this post
Blog | luminousmen
Only the Squeaky Wheel Gets the Oil
Copy link
Facebook
Email
Notes
More
Data Partitioning: Partition. Regret. Repeat
Partition early. Partition wisely.
Jun 10
•
luminousmen
3
Share this post
Blog | luminousmen
Data Partitioning: Partition. Regret. Repeat
Copy link
Facebook
Email
Notes
More
Why Parquet Is the Go-To Format for Data Engineers
With more practical lessons to help you with the data engineering journey
Jun 3
•
luminousmen
and
Vu Trinh
43
Share this post
Blog | luminousmen
Why Parquet Is the Go-To Format for Data Engineers
Copy link
Facebook
Email
Notes
More
7
May 2025
Pip Constraints Files
Don't just freeze your environment. Engineer it.
May 27
•
luminousmen
1
Share this post
Blog | luminousmen
Pip Constraints Files
Copy link
Facebook
Email
Notes
More
Data Partitioning: Slice Smart, Sleep Better
Partitioning is an architecture, not an optimization
May 13
•
luminousmen
2
Share this post
Blog | luminousmen
Data Partitioning: Slice Smart, Sleep Better
Copy link
Facebook
Email
Notes
More
April 2025
Understanding AWS Regions and Availability Zones: A Guide for Beginners
High Availability in the cloud: why us-east-1 alone is not a strategy (it's a gamble)
Apr 29
•
luminousmen
2
Share this post
Blog | luminousmen
Understanding AWS Regions and Availability Zones: A Guide for Beginners
Copy link
Facebook
Email
Notes
More
Change Data Capture (CDC)
Change Data Capture (CDC): what is it and why it's important?
Apr 22
•
luminousmen
9
Share this post
Blog | luminousmen
Change Data Capture (CDC)
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts