Junaid Effendi | Sharing knowledge for Engineers

Junaid Effendi | Sharing knowledge for Engineers

Home
🧑‍💻 Become a DE
🔍 Deep Dive
⭐ Tech Stack
🚀 Growth
🗃️ Archive
❓ About
Introduction to Spark Optimization
Starting the series of Spark optimization, covering different level of optimization.
Jul 8, 2019 • 
Junaid Effendi
Testing Data in Apache Spark
How to efficiently test large scale Spark jobs, using open source libraries.
Sep 9, 2023 • 
Junaid Effendi
Optimizing Spark Job (spark-submit/shell)
This article is second from our series, optimizing the spark command, we usually use two types of spark commands, spark-submit and spark-shell, both of…
Jul 20, 2019 • 
Junaid Effendi
Benchmarking Spark - Open Source vs EMRs
Diving into four approaches from Spark Operator to EMR (EKS, EC2, and Serverless), sharing benchmarking results and key insights to help you choose the…
Jul 5 • 
Junaid Effendi
Spark Performance Tuning with Ganglia and Sparklens
Leveraging Sparklens and Ganglia for optimizing Spark Jobs.
May 4, 2020 • 
Junaid Effendi
Optimizing Spark Query
This is the last article from our Spark Optimization Series. Optimizing a spark query is challenging as well as interesting, as a Data Engineer, I love…
Aug 5, 2019 • 
Junaid Effendi
Whats new in Spark 3.0?
Asa Data Engineer I wait for improved Spark version every year, and this yearlast month they introduced a major long awaited upgrade known as Spark 3.0.
Nov 24, 2019 • 
Junaid Effendi
Common Issues faced in Spark
There are several issues everyone faces when they start using spark either at their jobs or for fun.
Apr 9, 2018 • 
Junaid Effendi
Optimizing Spark I/O
Reading and writing files in Spark is part of most of the jobs, and to make it work the best way there are some approaches and techniques that should be…
Jul 27, 2019 • 
Junaid Effendi
Spark UDFs Are Cruel!
Spark UDFs (User Defined Functions) are not the best thing a developer will use, they look so cool especially the syntax to write them is really cool…
Aug 25, 2019 • 
Junaid Effendi
Apache Spark - Frequently Asked Questions
Developers and Engineers are now pretty much aware of Apache Spark and its purpose in the technological stack but somehow there are some basic questions…
Jan 27, 2019 • 
Junaid Effendi
Optimizing Spark Cluster
Learn the techniques to save cost and improve performance through cluster optimization.
Jul 15, 2019 • 
Junaid Effendi
© 2025 Junaid Effendi · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture