Cheatography https://cheatography.com

Download This Cheat Sheet (PDF)

Comments
Rating: ()

DL Cheat Sheet (DRAFT) by woobidoobi

This is a draft cheat sheet. It is a work in progress and is not finished yet.

Session

Interpreter	%jdbc(hive)	%livy2.pyspark
Create Session		"from pyspark_llap import HiveWarehouseSession hive = HiveWarehouseSession.session(spark).build()"
Connect to database	use ampb_sandbox	"hive.setDatabase(""dl_prod_sandbox_agi"") "
List Databases		hive.execute("show databases").show(truncate=False)
List Tables	show tables	hive.execute("show tables").show(truncate=False)

Data wrangling

function	sql equivalient	pyspark df
Selecting columns	select arbetsgivaravgift_organisationsnummer_id as orgnr, arbetsgivaravgift_redovisad_period as period, arbetsgivaravgift_inbetalat_belopp as inbetalt_belopp from df_agavgift	df_agavgift.select( col("arbetsgivaravgift_organisationsnummer_id").alias("orgnr"), col("arbetsgivaravgift_redovisad_period").alias("period"), col("arbetsgivaravgift_inbetalat_belopp").alias("inbetalat_belopp") )

Livy2.pyspark

Transfer data from hive to pyspark dataframe	pyspark_df = hive.table("hive_table")
Transfer data from pyspark to hive	pyspark_df.registerTempTable("hive_table")
Remove a temp table	hive_table.drop()

Combining data

inner join	df1.join(df2, df1.name == df2.name)
left join	df1.join(df2, df1.name == df2.name,how='left')
right join	ta.join(tb, ta.name == tb.name,how='right')
joining on multiple columns	df1.join(df2, df1.name == df2.name,how='right')

in pyspark you need to start by setting an alias for the tables that you want to join
df1 = TableA.alias('df1')
df2 = TableB.alias('df2')

Useful links:
http://www.learnbymarketing.com/1100/pyspark-joins-by-example/
http://www.learnbymarketing.com/618/pyspark-rdd-basics-examples/

Download the DL Cheat Sheet

1 Page

PDF (recommended)

PDF (1 page)

Alternative Downloads

Latest Cheat Sheet

1 Page

(0)

Dysphagia Cheat Sheet

Nutrition care process for dysphagia

4 Apr 26

Random Cheat Sheet

1 Page

(0)

Falcon 3.0 (DOS, 1991) Keyboard Shortcuts

Keyboard controls for Spectrum Holobyte Falcon 3.0 (DOS, 1991)

24 May 17

game, intermediate, dos

About Cheatography

Cheatography is a collection of 6889 cheat sheets and quick references in 25 languages for everything from google to programming!

Behind the Scenes

If you have any problems, or just want to say hi, you can find us right here:

Recent Cheat Sheet Activity

lenr updated Dysphagia.
10 hours 12 mins ago

Hopper updated Byobu SSH.
21 hours 23 mins ago

GregFinzer updated Windows Equivalent Software for Linux.
6 days 14 hours ago

DaveChild updated Sourdough Starter.
1 week 5 days ago

kinglash published FreeWolf k8 trimodal.
1 week 5 days ago

© 2011 - 2026 Cheatography.com | CC License | Terms | Privacy

Latest Cheat Sheets RSS Feed