petastorm online with Winfy

We have hosted the application petastorm in order to run this application in our online workstations with Wine or directly.


Quick description about petastorm:

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. Petastorm is an open-source data access library developed at Uber ATG. This library enables single machine or distributed training and evaluation of deep learning models directly from datasets in Apache Parquet format. Petastorm supports popular Python-based machine learning (ML) frameworks such as Tensorflow, PyTorch, and PySpark. It can also be used from pure Python code. A dataset created using Petastorm is stored in Apache Parquet format. On top of a Parquet schema, petastorm also stores higher-level schema information that makes multidimensional arrays into a native part of a petastorm dataset. Petastorm supports extensible data codecs. These enable a user to use one of the standard data compressions (jpeg, png) or implement her own.

Features:
  • Selective column readout
  • Open source data access library
  • Multiple parallelism strategies: thread, process, single-threaded (for debug)
  • Plain Python API
  • Row filtering (row predicates)
  • Partitioning for multi-GPU training


Programming Language: Python.
Categories:
Libraries, Machine Learning

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.