CartoDB - A Quick Guide

Amla Srivastava (as5196)


Introduction

CartoDB is a Software as a Service (SaaS) cloud computing platform that provides GIS and web mapping tools for display in a web browser. This review covers some basic features, requirements, advantages and drawbacks of the tool. The motivation for this tool review was my Final EDAV Project 'NYC Motor Vehicle Collisions' where my team and I used CartoDB for creating several maps.

CARTO is offered as a freemium service. This makes it easy for beginners/first-timers to get a flavor of the tool. CARTO basically offers two services:

  • CARTO Builder: allows users to manage data, run user side analysis and design custom maps; intended for non developers and beginners

  • CARTO Engine: a set of APIs and developer libraries for building custom map and data visualization interfaces for advanced users

Once the data has been uploaded, the user can create Maps using the data by selecting the 'New Map' option. There are several map types which can be used. The main chart types are shown here for the NYC Emergency Response Incidents dataset for the years 2011-2016. The dataset is available here.

*The maps here have been created purely for demonstration purposes and do not necessarily reflect the best way to plot the data.

Basic maps

The maps below show very emergency respnse incident in NYC that occured between 2011 and 2016 with yellow circles. In the case on the left, overlapping points are not blended whereas in the case on the right overlapping points are blended to make areas of higher incident count appear progresively darker. The map on the left also has pop-ups that appear on hovering and which display information about the borough and type of incident.

Animated maps

The maps below plot the individual emergency incidents between 2011-16 as a function of time. In the first map, all incidents are represented the same color whereas in the second map incident are colored by incident-type.

Area aggregation

The map below shows emergenecy response incidents aggregated by area where area is represented by hexbins/squares. The darker the color, the greater the no. of incidents in that area and vice versa.

Heatmap

The map below shows emergenecy response incidents over time in the form of a heatmap.

Advantages and Drawbacks

Carto offers several advantages when it comes to location-based data visualization such as:

  • Ease of use: It supports several data formats such as CSV, ZIP, SHAPEFILE, GeoJSON, TopoJSON, XLS and so on. Data can be uploaded easily from your local machine or cloud with a simple drag and drop operation. Once your dataset is connected you can make changes to your data, data types as well as select data using either the GUI or PostgreSQL. The GUI is very intuitive and user-friendly. CartoDB provides access to a number of in-built datasets. Unlike R, the tool does not require any specific skills except a basic background knowedge of map types and data pre-processing. It also has functionality to geocode data from country names/ postal codes and so on in case location information is not explicitly given in the data. Compared to R, it is very easy to map customized maps since it does not require any coding skills.

  • Several chart types: Carto offers multiple charting options, both static and dynamic. Data can be displayed as individual data points or aggregated as squares or hexbins and in heatmaps. Visualizations can also be animated by displaying data as a function of time. More advanced chart types are available in the paid versions.

  • Customization: Carto has several options to customize visualizations using different color schemes, sizes, and strokes. One can also add pop-up labels and customized legends to better explain the data. Search boxes, zoom controls and layer selectors can be eaily integrated. Several types of analysis such as clustering, filtering, trend analysis, etc. can also be done. Carto maps work on a layered model, such that information can be plotted on consecutive layers. Charts are also visually appealing.

  • Easy of sharing: Carto maps support one-click publishing to the web and are much easier to embed and share as compared to graphs created in R.

However, as with any data visualization tool there are certain challenges while using it.

  • Speed: For larger datasets, uploading data takes a very long time. Updates/changes to the database and maps for such data are also very slow and cumbersome.

  • Limited flexibility: Despite several options for customization, Carto lacks the flexibility offered by other visualization tools since it addresses a very specific domain.

  • Price: Carto services come in three flavors - Free, Personal and Enterprise. At $149/year, the Personal package is expensive if the user only occassionaly needs to plot map-based data.


Resources

Some basic tutorials for CartoDB are available here and the complete documentation is available at this link.