data
Arrow
Package description
Apache Arrow (incl. PyArrow Python bindings), a cross-language development platform for in-memory data.
Use latest version
Use specific version
BeautifulSoup
https://www.crummy.com/software/BeautifulSoup
Package description
Beautiful Soup is a Python library designed for quick turnaround projects like screen-scraping.
Use latest version
Use specific version
DBD-mysql
https://metacpan.org/pod/distribution/DBD-mysql/lib/DBD/mysql.pm
Package description
Perl binding for MySQL
Use latest version
Use specific version
DB_File
https://perldoc.perl.org/DB_File.html
Package description
Perl5 access to Berkeley DB version 1.x.
Use latest version
Use specific version
GDAL
Package description
GDAL is a translator library for raster geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation. As a library, it presents a single abstract data model to the calling application for all supported formats. It also comes with a variety of useful commandline utilities for data translation and processing.
Use latest version
Use specific version
HDF
https://support.hdfgroup.org/products/hdf4/
Package description
HDF (also known as HDF4) is a library and multi-object file format for storing and managing data between machines.
Use latest version
Use specific version
HDF5
https://portal.hdfgroup.org/display/support
Package description
HDF5 is a data model, library, and file format for storing and managing data. It supports an unlimited variety of datatypes, and is designed for flexible and efficient I/O and for high volume and complex data.
Use latest version
Use specific version
Hydra
Package description
Hydra is an open-source Python framework that simplifies the development of research and other complex applications. The key feature is the ability to dynamically create a hierarchical configuration by composition and override it through config files and the command line. The name Hydra comes from its ability to run multiple similar jobs - much like a Hydra with multiple heads.
Use latest version
Use specific version
Jansson
https://www.digip.org/jansson/
Package description
Jansson is a C library for encoding, decoding and manipulating JSON data. Its main features and design principles are: * Simple and intuitive API and data model * Comprehensive documentation * No dependencies on other libraries * Full Unicode support (UTF-8) * Extensive test suite
Use latest version
Use specific version
LAME
Package description
LAME is a high quality MPEG Audio Layer III (MP3) encoder licensed under the LGPL.
Use latest version
Use specific version
MariaDB
Package description
MariaDB is an enhanced, drop-in replacement for MySQL. Included engines: myISAM, Aria, InnoDB, RocksDB, TokuDB, OQGraph, Mroonga.
Use latest version
Use specific version
PnetCDF
https://parallel-netcdf.github.io/
Package description
Parallel netCDF: A Parallel I/O Library for NetCDF File Access
Use latest version
Use specific version
PostgreSQL
Package description
PostgreSQL is a powerful, open source object-relational database system. It is fully ACID compliant, has full support for foreign keys, joins, views, triggers, and stored procedures (in multiple languages). It includes most SQL:2008 data types, including INTEGER, NUMERIC, BOOLEAN, CHAR, VARCHAR, DATE, INTERVAL, and TIMESTAMP. It also supports storage of binary large objects, including pictures, sounds, or video. It has native programming interfaces for C/C++, Java, .Net, Perl, Python, Ruby, Tcl, ODBC, among others, and exceptional documentation.
Use latest version
Use specific version
PyTables
Package description
PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data. PyTables is built on top of the HDF5 library, using the Python language and the NumPy package. It features an object-oriented interface that, combined with C extensions for the performance-critical parts of the code (generated using Cython), makes it a fast, yet extremely easy to use tool for interactively browsing, processing and searching very large amounts of data. One important feature of PyTables is that it optimizes memory and disk resources so that data takes much less space (specially if on-flight compression is used) than other solutions such as relational or object oriented databases.
Use latest version
Use specific version
SUMO
Package description
Simulation of Urban MObility" (SUMO) is an open source, highly portable, microscopic and continuous traffic simulation package designed to handle large networks. It allows for intermodal simulation including pedestrians and comes with a large set of tools for scenario creation.
Use latest version
Use specific version
XML-LibXML
https://metacpan.org/pod/distribution/XML-LibXML/LibXML.pod
Package description
Perl binding for libxml2
Use latest version
Use specific version
dask
Package description
Dask natively scales Python. Dask provides advanced parallelism for analytics, enabling performance at scale for the tools you love.
Use latest version
Use specific version
dill
https://pypi.org/project/dill/
Package description
dill extends python's pickle module for serializing and de-serializing python objects to the majority of the built-in python types. Serialization is the process of converting an object to a byte stream, and the inverse of which is converting a byte stream back to on python object hierarchy.
Use latest version
Use specific version
h5py
Package description
HDF5 for Python (h5py) is a general-purpose Python interface to the Hierarchical Data Format library, version 5. HDF5 is a versatile, mature scientific software library designed for the fast, flexible storage of enormous amounts of data.
Use latest version
Use specific version
netCDF
https://www.unidata.ucar.edu/software/netcdf/
Package description
NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.
Use latest version
Use specific version
netCDF-Fortran
https://www.unidata.ucar.edu/software/netcdf/
Package description
NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.
Use latest version
Use specific version
pugixml
Package description
pugixml is a light-weight C++ XML processing library
Use latest version
Use specific version
pycocotools
https://pypi.org/project/pycocotools
Package description
Official APIs for the MS-COCO dataset
Use latest version
Use specific version
scikit-learn
https://scikit-learn.org/stable/index.html
Package description
Scikit-learn integrates machine learning algorithms in the tightly-knit scientific Python world, building upon numpy, scipy, and matplotlib. As a machine-learning module, it provides versatile tools for data mining and analysis in any field of science and engineering. It strives to be simple and efficient, accessible to everybody, and reusable in various contexts.