WiseWithData
  • About Us
    • Our Story
    • Our Clients
    • Newsroom
    • Our Partners
    • Our Privacy Policy
    • Terms Of Service
  • Solutions
    • Automated SAS To Python PySpark Migration
    • Automated SAS To Databricks Migration
    • SPROCKET Runtime For PySpark
    • Apache Spark Consulting
  • People
    • Recruiting
    • Students & Internships
  • Discover
    • Spark Café Blog
    • SAS Migration FAQ
    • The White Papers
    • Resources
  • Access
    • Customer Portal
    • Partner Portal
  • Contact Us
Select Page

RDDs vs DataFrames vs DataSets: The Three Data Structures of Spark

by Mike Sun | May 20, 2020 | Apache Spark, Apache Spark Cafe, Java, Python, R, Scala

RDD, DataFrame, and Dataset are the three most common data structures in Spark, and they make processing very large data easy and convenient. Because of the lazy evaluation algorithm of Spark, these data structures are not executed right way during creations,...

The Rise in Popularity of Apache Spark

by Mike Sun | May 20, 2020 | Apache Spark, Apache Spark Cafe

Since year end 2014, there has been an increase in the number of Google searches comparing Apache Spark to Hadoop. What brings people who are experts in Big Data, Data Science, and Data Analysis to Apache Spark (Spark)? Spark is a fast and expressive cluster computing...

About Us

  • Our Story
  • Our Clients
  • Newsroom
  • Our Partners
  • Our Privacy Policy
  • Terms Of Service

Solutions

  • Automated SAS To Python PySpark Migration
  • Automated SAS To Databricks Migration
  • SPROCKET Runtime For PySpark
  • Apache Spark Consulting

People

  • Recruiting
  • Students & Internships

Discover

  • Spark Café Blog
  • SAS Migration FAQ
  • The White Papers
  • Resources

Access

  • Customer Portal
  • Partner Portal

Contact Us

  • Contact Us
© 2015-2025 Wise With Data Inc.