Spark
HOME
SPARK
SPARK ARCHITECTURE
Cluster Manager
Lineage Graph
Directed Acyclic Graph
Spark Execution Model
Spark 3.0
Spark Context
Spark Session
DataSets
Spark Submmit
DataSets
Spark Execution Model
Spark Streaming
RDD
RDD Basic
RDD using Parallelize
RDD using textFile
RDD From Existing RDD
DataFrom From Case Class
Spark Submmit
Empty RDD
Pair RDD
lt;/li>
Transformation
Narrow Transformation
Wide Transformation
Transformation Example
Actions
RDD Example
lt;/li>
DataFrame
DataFrame Basic
DataFrom From RDD
DataFrom From CSV
DataFrom From Hive Table
DataFrom From Parquet
DataFrom From Avro
DataFrom From Json
DataFrom From XML
DataFrom From Case Class
DataFrom with StructField and StructType
Save DataFrom
DataFrame Join
BroadCast Join
Shuffle hash Join
Short-Merg Join
Regular Expression
Submenu 5.2
Submenu 5.3
Basic Commands
Select Columns
Filter
IN-ISIN-NOT IN
LIKE NOT LIKE
GroupBy
Spark Functions
Submenu 5.1
Submenu 5.2
Submenu 5.3
Submenu 5.1
Submenu 5.2
Submenu 5.3
Submenu 5.1
Submenu 5.2
Submenu 5.3
Spark Sql
Aggregate Functions '
Window Function '
Window Aggregate Functions
withColumn
CASE WHEN
Rename Column
Sort DataFrame
Distinct
NULLS
Add literal and Constant
Type Casting
Split Column
DataFrame Partation
Optimization
Partition
Shared Variables
Broadcast join
Persistance
coalesce
cache
Data Skew
Interview
OOM
Compression In Spark
Config file
Interview-3
Spark Challenges
Submenu 5.3
Submenu 5.1
Submenu 5.2
Submenu 5.3
Submenu 5.1
Submenu 5.2
Submenu 5.3
SESSION 3
The Partitioned Primary Index (PPI) feature allows a class of queries to access a portion of a large table insted of the whole table.
No comments:
Post a Comment
Home
Subscribe to:
Posts (Atom)
No comments:
Post a Comment