R is a free software environment for statistical computing and graphics. It has strong capabilities and can be highly extensible with object-oriented features. It is a very powerful visualization and statistical analysis tool used in BigData for the following purposes

  • Visualization ( Graphs, Charts, etc) using packages such as ggplot2 and/or some inbuilt functions like plot() ,etc.
  • Data Wrangling for cleaning the data to extract useful information.
  • EDA (Exploratory Data Analysis) – Insights from Data
  • Parallel / Cluster computation using Apache Spark (SparkR).

Categorized in: