Session III: Special Topics in Large Data Processing

Date
Nov 18, 2024, 4:30 pm6:00 pm

Details

Event Description

This series introduces SQL for social scientists, with a focus on DuckDB -- a portable, fast, and scalable database for analytics. With only a few basics, researchers can easily scale projects to handle most larger-than-memory data processing tasks without need of a cluster. We will discuss how to conveniently integrate DuckDB's SQL dialect into existing R (and Python) workflows. We conclude with special topics, which may include geospatial data processing, databases on the cluster, and some SQL best practices.

Sponsor
Data-Driven Social Science Initiative