We have hosted the application sparklyr in order to run this application in our online workstations with Wine or directly.


Quick description about sparklyr:

sparklyr is an R package that provides seamless interfacing with Apache Spark clusters�either local or remote�while letting users write code in familiar R paradigms. It supplies a dplyr-compatible backend, Spark machine learning pipelines, SQL integration, and I/O utilities to manipulate and analyze large datasets distributed across cluster environments.

Features:
  • Connects to Spark via YARN, Mesos, Kubernetes, Livy or local mode
  • Enables dplyr-style data transformation on Spark DataFrames
  • Supports SQL queries and ML pipelines (ml_* API)
  • Includes tools for distributed computing, window functions, streaming
  • Extensible with packages like sparkxgb, graphframes, H2O
  • Handles reading/writing CSV, Parquet, JSON, and caching operations


Programming Language: R.
Categories:
Data Management

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.