DDSS holds its own workshops during the academic year. These events will be announced through our mailing list and will be posted here. Ideas for additional workshops, including advanced technical training by external sources, can be submitted to [email protected] for consideration.
The DDSS Coffee Hour is a dedicated forum for informal discussion and interdisciplinary networking for those working in advanced quantitative or computational social science.
Graduate students, faculty, and staff across disciplines are invited to learn more about the DDSS, discover the areas of technical specialization offered by our fellows, seek informal consultation, highlight particular needs, and discuss new ideas.
Each meeting opens with a presentation from our Graduate Fellows of Social Data Science on the methodological and technical aspects of their work. We then move to general discussion and unstructured conversations. Fellows are joined by the DDSS Director (Rocío Titiunik), Director of Research (Lori Bougher), and DDSS technical staff.
DDSS coordinates theme-based meetings so researchers at all levels (graduate students, research specialists, postdocs, faculty, etc.) can meet others working in these technical niche areas.
The aim is to share knowledge across disciplines, avoid duplicate efforts made in isolation, and identify areas through which DDSS may best support these efforts (e.g., providing a repository for crowdsourced code, coordinating advanced training, or developing relevant tools).
These groups offer an informal space to delve deeper into particular subareas, share new packages, introduce new methodologies, and develop a theme-based technical network of support.
Initial groups will include:
- Topics in large data management and processing
- Record linkage techniques
- Geospatial data
If you are interested in joining these groups or would like to propose an additional group, please reach out to Eric Manning, our Postdoctoral Research Associate, at [email protected]
The Princeton R Group, sponsored by DDSS, will host a series of monthly R lunch-n-learns, lightning talks, and invited speakers. Details for forthcoming events will be added to this page. Please contact DDSS Graduate Fellow Kim Kreiss or Angela Li to learn more about the group or propose topics for future sessions.
This workshop covers topics for advanced geospatial analysis in R, with an emphasis on incorporating analyses into existing R workflows without the need for bindings to standalone GIS software. Covered material will include spatial density and concentration analysis, spatial autocorrelation, identification of local clusters or outliers, and…
This workshop introduces researchers to tools for large geospatial data processing in R and SQL. Before researchers can conduct statistical analyses, they often first need to standardize and geolocate a list of addresses, then perform one or more common operations, such as spatial joins, identifying and aggregating covariates from nearest…
No content available to show.
No content available to show.
This workshop briefly introduces participants to several R tools for record linkage. We then discuss Splink, an increasingly popular Python package for record linkage and deduplication. Following a brief (re)introduction to the underlying probabilistic framework, a code-along will demonstrate an example use case, highlighting the package’s…
Researchers often need to merge or deduplicate datasets that lack common identifiers for unique individuals or entity names. A researcher’s choice of solution to this problem can often affect downstream analyses, and researchers may also wish to incorporate uncertainty from this stage into their analyses. In this workshop, we introduce record…
Are you looking to share your Python code? This workshop will show participants the best practices for making a compatible and installable Python package. Participants will work through a set of hands-on exercises that cover the various steps required to publish a package on the Python Package Index (PyPI).
The Princeton R Group will be hosting its final event of the semester with a set of lightning talks! These are informal talks where people across policy and the social sciences can come showcase an interesting application of R for their work. Speakers will present for 10-15 minutes and take questions at the end of their talk. The goal is to…
We show how applied researchers can estimate differences between groups in terms of those groups' "understandings" of terms. We present inferential machinery for this, along with a convenient R package. We show how these ideas can be extended to non-English languages.
*Lunch will be provided!* Please RSVP below…
Many researchers are interested in conducting statistical analysis of large-scale data contained in scanned books or other documents. While OCR technology is increasingly capable of producing text versions of scanned documents, its unstructured (and often typo-ridden) products can be challenging to analyze. This talk will discuss how to…
This lunch-n-learn workshop is the first in a series of workshops hosted by tigeRs, the Princeton R Group, during the Fall 2023 semester.
ChatGPT and generative AI tools offer an opportunity for researchers to do their work more effectively, but social scientists, researchers, and policy analysts aren't always sure where to…
The ability of GPT-3 and its "cousins" to generate texts for a broad range of tasks–from summarization to question-answering and information retrieval—via an intuitive natural language interface—may hold great promise for social science research. Already, researchers are using GPT-3 for tasks ranging from optical character recognition (OCR)…
This two-part workshop goes from a gentle introduction to key concepts in deep learning to the mechanics of advanced architectures used in the cutting edge of research. In Part 1, we will cover the core building blocks of neural networks models, and gain an understanding of the key intuitions behind using neural networks to model data.
Creating engaging and accessible data visualizations is an essential tool for policy and research communication. We will discuss the main concepts of the grammar of graphics and use applied examples to explore ggplot2’s ability to create multi-layered and complex graphs. The workshop provides a short overview of the basics of ggplot2 for data…