Arvados
An open source platform for managing, processing, and sharing genomic and other large scientific and biomedical data
With Arvados, bioinformaticians run and scale compute-intensive workflows, developers create biomedical applications, and IT administrators manage large compute and storage resources.
Flexibility
Run anywhere — in the cloud on AWS, Azure and GCP, as well as on premise and hybrid clusters. Scale compute resources dynamically, as needed.
Scale
Manage petabytes of data and run workflows that use thousands of cores of compute simultaneously.
Freedom
Use 100% free and open software that you can control. Software that is developed in public, backed by both strong commercial support and an active community.
Confidence
Be confident in your results. Easily identify the origin and verify the content of every dataset. Track every CWL workflow you run, reliably reproducing any output.
Decentralization
Run on distributed data using federated multi-cluster workflows. Federation brings compute to the data, allowing you to work on sequestered data and avoid expensive data transfers.
Security
Collaborate on projects by selectively and securely sharing your data and workflows. Share with the world by publishing your results. Comply with data protection regulations.
Try
Take a look at the public Arvados playground, a free-to-use installation of Arvados for evaluation and trial use.
Install
Learn how to install Arvados from the installation page in the documentation. Arvados supports AWS, GCP and Azure cloud platforms as well as on-premises installs.
Participate
Join the Arvados community channel for live discussion, Arvados forum for community support, or one of our regular video chats. Check out the development wiki for getting started as a contributor.