largecats' blog
data engineer
Home
Archives
Categories
Tags
About
Categories
blog
2019-06-20 Thu.
Solving Gitalk validation error with status code 500
blog
life-saver
gitalk
2019-06-17 Mon.
Inserting images in blog post
blog
life-saver
markdown
2019-06-17 Mon.
Building blog with GitHub Pages, Jekyll, and Gitalk
blog
github-pages
jekyll
gitalk
life-saver
2019-10-29 Tue.
(Py)Spark UDF Caveats
life-saver
work
pyspark
spark
2019-09-22 Sun.
Run C/C++ program from Windows Subsystem for Linux in Visual Studio Code
life-saver
wsl
C/C++
vs-code
2019-08-19 Mon.
Installing Hadoop, Spark, and Hive in Windows Subsystem for Linux (WSL)
life-saver
work
hadoop
spark
hive
wsl
2019-08-17 Sat.
Setting up Jupyter Notebook kernel for Scala, Python to use Spark
life-saver
work
spark
scala
python
2019-07-31 Wed.
Setting up PySpark in Jupyter Notebook
life-saver
work
spark
pyspark
2019-06-20 Thu.
Solving Gitalk validation error with status code 500
blog
life-saver
gitalk
2019-06-18 Tue.
Solving Qt plugin error when calling matplotlib.pyplot
life-saver
python
2019-06-17 Mon.
Inserting images in blog post
blog
life-saver
markdown
2019-06-17 Mon.
Deleting large folders on Windows
life-saver
dos
fun
2019-07-30 Tue.
Turning addresses into coordinates via Google Map API
fun
python
2019-06-20 Thu.
OCR with comics
fun
ocr
python
2019-06-19 Wed.
Text analysis with movie reviews
fun
text-analysis
python
nltk
sentiment-analysis
2019-06-18 Tue.
Scraping movie information: IMDb vs. Douban
fun
web-scraping
regular-expression
python
html
work
2021-06-10 Thu.
Stream Processing 101
work
flink
2021-03-30 Tue.
Spark Data Pipeline Framework Management
work
spark
python
scala
2021-03-30 Tue.
sbt Multi-Project Dependency
work
spark
sbt
scala
2021-03-07 Sun.
Spark Magnet: Push-based Shuffle
work
spark
2021-01-02 Sat.
Spark Partitions
work
spark
2020-12-01 Tue.
Variances in Type Systems
work
scala
2020-10-17 Sat.
Caching in Spark
work
spark
YARN
2020-10-09 Fri.
Solving Spark timeout errors
work
spark
2020-09-21 Mon.
Collecting Log in Spark Cluster Mode
work
spark
YARN
shell-scripting
2019-12-06 Fri.
Caching and Unpersisting Pyspark RDD
work
pyspark
2019-10-29 Tue.
(Py)Spark UDF Caveats
life-saver
work
pyspark
spark
2019-08-19 Mon.
Installing Hadoop, Spark, and Hive in Windows Subsystem for Linux (WSL)
life-saver
work
hadoop
spark
hive
wsl
2019-08-17 Sat.
Setting up Jupyter Notebook kernel for Scala, Python to use Spark
life-saver
work
spark
scala
python
2019-07-31 Wed.
Building Scala project with and without sbt
work
scala
2019-07-31 Wed.
Installing Scala on Windows
work
scala
2019-07-31 Wed.
Setting up PySpark in Jupyter Notebook
life-saver
work
spark
pyspark
2019-07-31 Wed.
Installing Spark on Windows
work
spark
pyspark
scala
computer-systems
2019-11-18 Mon.
Two's Complement
computer-systems
computer-systems
Content
blog (3)
life-saver (9)
fun (4)
work (17)
computer-systems (1)