Capabilities for Bioinformaticians| Curoverse

Arvados is a new generation of open source software infrastructure that addresses the most important challenges in production data science. Deployed in the cloud or on a cluster, it helps you manage massive datasets, run complex pipelines in production, reproduce your work, collaborate with colleagues, and publish your results.

Start Using Arvados Today

Start fast with a free cloud trial account managed by Curoverse or deploy it in your own cloud account.

Arvados is designed to run in large, elastic computing clusters, so you can deploy it in your own cluster.

An Arvados cluster can be setup on a single computer using docker for development and testing.

Work the Way You Want

Use your favorite languages

Arvados works with any programming language and includes SDKs for Python, Perl, Ruby, Java, and Go.

Choose command line, API, or browser

Interact with Arvados services through the command line, a REST API, or an intuitive web application.

Leverage standards

Arvados is on the cutting edge of new standards including implementations of the and

Access the tools you know

The Arvados community is porting common pipelines to the platform so you have quick access to best open source tools and pipelines.

Stay free and open source

Arvados is 100% open source so you will never be locked into a proprietary system or stuck in a black box.

Feel the Power

Scale computations

Arvados can scale parallel computations to thousands of nodes so you can handle any job without all the hassles of configuring an HPC cluster.

Handle massive data sets

Work with everything from terabytes to petabytes of data with great performance, fault tolerance, and automatic data integrity checking.

Match the compute to the job

The system automatically provisions compute resources matched to the needs of each job with the right amount of RAM, CPU, and runtime libraries.

Deploy applications

Launch web applications at the end of pipelines in your own virtual servers, so you can interact with your data.

Run in the cloud and on premise

Arvados can run in your data center, in the cloud, or in a hybrid configuration that lets you take advantage of the best of both.

Be More Productive

Stay organized

Easily put data, pipelines, computational runs, and results into projects that help you keep your work neatly organized.

Go from ad hoc to production

With an Arvados cluster, you get root access to a virtual private server to do your work. From there, it's simple to go to scaled production.

Easily reproduce any pipeline

Arvados tracks every pipeline you run and makes reliably reproducing any pipeline easy.

Manage, organize, and re-organize data

Using flexible data management tools, you can quickly create and re-organize datasets with anything from one file to a million, without copying.

Know the origin and use of data

Every dataset you generate in Arvados is tracked, so you can easily figure out where it came from and how it’s used.

Collaborate, Share & Publish

Share with colleagues

Easily and securely share datasets, pipeline templates, and pipeline runs with colleagues in your lab and organization.

Collaborate with your team

Use shared projects to keep everything together for collaboration with other bioinformaticians, developers, and researchers.

Publish to the world

Share your pipelines and sample data in a public project that anyone can access without logging in, so they can see your methods.

Stay secure

Flexible permission controls let you secure access to datasets and projects without worrying about all the hassles of traditional file/folder permissions.

Copy whole projects

Switch clusters or organizations and easily copy every aspect of your work including code, pipelines, Docker images, data, and metadata.

Learn More About the Technology