Networks and digital scholarship

Digital scholarship offers a wide variety of methodological and pedagogical approaches, including text and data mining, network analysis, and spatial analysis.

Among the more powerful desktop applications for network analysis such as Gephi, these software provide algorithms for interpreting networks and relationships as well as creating visualizations. Web approaches, to date, have not come with the computational power of desktop applications, but allow the easier creation of network visualizations. Networks also tend to be tricker to comprehend over maps, which entails a more intense methodological borrowing than we do with maps. That includes using algorithms designed by computer scientists for purposes that might be at odds of the digital scholar. As Scott Weingart once warned, “Networks can be used on any project. Networks should be used on far fewer.”

The aim of this workshop is to help introduce you to some of these approaches for making and using networks.

How this workshop will proceed

Our time together is limited and we cannot go too deeply into working with, obtaining, or preparing network data. My hope is to introduce you to the basics, provide a set of tools to demonstrate what is possible, and continue the conversation beyond this workshop.

Be aware, also, that far more time will be spent preparing and cleaning data than creating the visualizations themselves. Some argue that you’ll spend as much as 80% of your time preparing your data for visualization, while the remainder 20% goes to the work of visualization itself. Since we’re together for a limited time, we won’t have a chance to really experience the tedious, frustrating, and arduous work that can go into preparing network data. These are approaches that you will need to develop for your research, and skills that you will bring to your students. As you take these skills to the classroom, consider the kinds of things your students can complete and receive the most reward for effort.

That said, networks—through desktop software and web-based platforms—have made the tools of these visualizations easier for scholars and students to grapple with. Maps are often more readily understood than networks—because we frequently use maps as part of our day to day lives, their use for research and narrative is more apparent than networks which comes with an entire set of vocabulary, methods, theory, and approaches. The interactivity of maps allows scholars to filter data, to operate at different spatial scales, and embed sources into maps. Similarly, networks can move from large-scale relationships to explorations of smaller connections. These techniques, however, take time. Once you’ve begun to master them, consider how to bring these approaches and skills into your classroom and your research.

As noted above, the biggest challenge with creating networks is the data—namely, once the data is ready and you’ve gained some competendnce in making these visualizations, the process proceeds quickly. Far more of your time—and your student’s time—will be spent in preparing data, a task that’s highly variable to the visualization and evidence you are working with. In other words, the evidence you’re attempting to visualize comes with the very thing digital scholars spend their time on—the finding and interpretation of evidence.

(Figure adapted from Lincoln Mullen.)

The network toolbox


Gephi is an open-source network analysis and visualization platform developed by the University of Technology of Compiégne in France. Gephi is currently the most popular software package for creating network graphs, combining both various algorithmic approaches to analyzing networks with the visualization of the network itself.

Note that we may run into some problems with Gephi. It has a notorious problem with Java, and although the new version claims to be more stable, many have reported that isn’t the case (myself included.)

Previewing a network in Gephi.

Previewing a network in Gephi.


Cytoscape is an open-source software platform designed originally for scientists, largely used in biological research, but can graph any network of nodes and edges. Like Gephi, it has an ecosystem of plugins developed by others in the community that provide various alternative ways of visualizing networks in the platform.

Previewing a network with Cytoscape.

Previewing a network with Cytoscape.


Palladio, developed by the Humanities+Design lab at Stanford University, is designed for the easy visualization of humanities data as a map, graph, table, or gallery. Palladio’s network visualization capabilities allow for the visualization of a bivariate graph based on any two dimensions inside a dataset, that can then be filtered based on attributes in the data.

Previewing a network with Palladio.

Previewing a network with Palladio.

Programming languages

The most powerful method for creating networks come from programming languages such as R, Python, and Javascript. These languages allow you to control various algorithmic and aesthetic aspects of network visualizations. These include powerful libraries such as igraph and sne. We won’t be digging into any languages during this workshop, but there are many resources available for getting started with programmatic approaches to visualization. See the resources page for more.

Before this workshop

There are a few things to install for the workshop:

  • Gephi. A common platform for building network visualizations.
  • A plain text editor. For Mac I recommend TextWrangler. Please do not use TextEdit. For Windows, Notepad++.
  • A program for editing comma-separated values (CSV) files. Microsoft Excel does things to data that should never be done to data, so I recommend Libre Office as a more reliable alternative.

Additionally, an FTP/SFTP client for accessing websites might be useful but is optional. On Mac and Windows I recommend Cyberduck.