For anyone who has storage issues or is not in a position to afford a system compatible to their requirements for data science work, Google Colab has been a blessing.
Working with Colab has opened up so many avenues for me that I thought weren’t possible before. We no longer have the restriction of poor computational power on our machines. And Free GPUs are at our fingertips.
But, as a data scientists it is important to know all the alternatives available for a particular tool. So in this article, we will be explring some of the widely used alternatives to google colab.
1- What is Google Colaboratory?
2- Alternatives of Google Colab
3- Other Alternatives
Google Colaboratory or Google Colab is a free Jupyter Notebook environment. It is a free cloud-based service by Google which means you don’t have to pay anything. One of the best things about Colab is that you don’t need to install anything beforehand. In fact, many of the Data Science and Machine Learning libraries such as Pandas, NumPy, Tensorflow, Keras, OpenCV come pre-installed with Colab.
The notebooks you create are saved on your Google Drive. So Colab also leverages the collaboration features of Google Docs, where you can share your notebook with multiple people easily and all of you can work on the same notebook at the same time without any issue.
Google also provides the use of a free NVIDIA Tesla K80 GPU. If you connect Colab to Google Drive, that will give you up to 15 GB of disk space for storing your datasets. You can run the session in an interactive Colab Notebook for 12 hours, which is enough for a beginner. Google has its self-made custom chips called TPUs.
One more thing to keep in mind is that the dataset you upload in the Colab notebook gets deleted once the session is ended.
However, you can upgrade to the Pro version, which supposedly gives you access to faster GPUs – NVIDIA TESLA T4 or P100 GPUs, longer runtimes up to 24 hours, and more RAM.
In the following section, we discuss the top 5 alternatives to google colab.
Amazon SageMaker is also a cloud-based Machine Learning platform developed by Amazon in November 2017. It provides hosted Jupyter notebooks that require no setup. But it is not free. Yes, you have to pay for its services, though the trial is free(for the initial two months).
“Using SageMaker Studio is free, you only pay for the AWS services that you use within Studio.”
Here is the guide on how to use SageMaker and its features.
CoCalc or Collaborative Calculation is a web-based cloud computing (SaaS) and course management platform for computational mathematics. It is an open-source software hosted by SageMath Inc. The creator and lead developer of CoCalc are William Stein, a former professor of mathematics at the University of Washington. Along with the Jupyter notebook it supports editing of Sage worksheets and LaTeX documents.
You can get started with CoCalc from here.
Kaggle is a popular platform for its Data Science Competitions, however, they also provide free Kernels or Notebooks for performing all the Machine Learning and Data Science tasks, independent of the competitions. Kaggle Kernels is a free platform to run Jupyter notebooks in the browser. Both Colab and Kaggle are the product of Google and have many similarities.
Kaggle has updated its kernels to have more computation power and memory. 20GB Dataset, 5GB Disk Space, 9 hours run time, and 4 CPUs w/ 16GB RAM or when the GPU is turned on it is 2 CPU cores w/ 13GB RAM.
Binder is powered by BinderHub, which is an open-source tool that deploys the Binder service in the cloud. Binder allows you to create custom computing environments that can be shared and used by many remote users. It allows you to input the URL of any public Git repository, and it will open that repository within the native Jupyter Notebook interface. You can run any notebooks in the repository, though any changes you make will not be saved back to the repository.
It can be helpful when you have a repository full-on Jupyter Notebooks. Although there is a user limit of 100 users for a repository(which is enough I guess).
Some of the other alternatives which I didn’t mention in this article can be-
1- Saturn Cloud
3- Datalore
Do check them out.
The purpose of this article was just to give an idea about the possible alternatives of Google Collaboratory, the end decision is up to you which one would you prefer according to your need. I hope you will explore all of these platforms and identify the pros and cons for your line of work.
Also do let me know which platform do you prefer/use and why.
Nice post! Maybe one can also mention Databricks Community Edition (free, https://community.cloud.databricks.com). Quite handy. And naturally dealing with python, R, SQL, Scala, Java. Supporting Spark. But no gpu's in the free edition, you need to upgrade for that.
Thanks Philippe, and thanks for telling about one more alternative. Appreciated!
a=34 if a % 2 == 0: print("The number is even") else: print ("The number is odd") print("Here it is anyway")