Spark

HOME
SPARK
RDD
- RDD Basic
- Empty RDD
- Pair RDD
- Transformation
- Actions
- RDD Example
DataFrame
DataFrame Join
Basic Commands
Spark Sql
Optimization
Interview

SESSION 3

The Partitioned Primary Index (PPI) feature allows a class of queries to access a portion of a large table insted of the whole table.

Email This BlogThis!Share to X Share to Facebook Share to Pinterest

No comments:

Post a Comment

Subscribe to: Posts (Atom)

Followers

Popular Posts

Teradata Utility Error Handling - Multiload

The Teradata Multiload utility provides the capability to perform batch maintenance on tables (insert, update, delete and upsert). It loads...
Teradata Parallel Transporter

Teradata Parallel Transporter is the preferred load/unload tool for the Teradata Database. Parallel Transporter is able to run all the bul...
Dynamically manage Hive Table in spark

Method to Drop Table and Files object test { def main(args: Array[String]): Unit = { val databaseName = args(0) val tableName ...
Install Apache Spark on Windows

Prerequisites This guide assumes that you are using Windows 10 and the user had admin permissions. System requirements: Windows 10 OS At l...

Links

Unix
BigData
Hive
AWS
MongoDB
Python
Scala
Snowflack
Sqoop
Teradata

Theme images by Maliketh. Powered by Blogger.