Edge + Digital Scholarship Events (Libcal)
Participants in this session will acquire a general understanding of topic modeling, the automated analysis technique often referred to as "text mining." Topic modeling can refer to a number of different algorithms, which are computationally intensive and mathematically complex. To facilitate a hands-on approach with a focus on process, this workshop uses the open-source MALLET toolkit as a platform for exploring topic modeling with LDA (Latent Dirichlet Allocation) and will not offer a comparison of algorithms. In addition to topic modeling, this session introduces the concepts of sequence labeling and automated document classification, both of which are also possible with MALLET.
This workshop focuses on the technical dimensions of corpus development. Using an array of printed matter -- from digital facsimiles of incunabula to modern letterpress/offset books -- we will explore the risks and benefits of optical character recognition (OCR); file formatting and naming issues; organization strategies for large corpora; and problems of data cleaning and preparation. We will also look at some common sources for textual research data, such as Project Gutenberg, the Internet Archive, and Google Books. While this session will not examine legal issues in detail, we will discuss some common legal concerns around the use of textual corpora.
Who are the intended users of your digital publication? How can you reach new audiences and keep your existing audiences actively engaged? We'll learn about some of the ways successful projects connect with their users and promote their work to potential audiences. We’ll also consider how to effectively and ethically involve and credit audience involvement in one’s research and do a quick overview of some annotation tools that foster this kind of engagement (e.g., VideoAnt, StoryMap, Genius, Hypothes.is). Participants will leave this session with a solid grounding in the ethical and logistical dimensions of engaging audiences and incorporating audience involvement into their own publication practices.
Format: This two-hour workshop is meant to promote and engage discussion around students’ specific digital publishing concerns. Consequently, attendance is capped at 15 students, and participants will be asked to share their specific interests and needs ahead of time, to help ensure that presentation examples and discussion points are sufficiently relevant. We will provide numerous examples (projects and tools) to help illustrate key points.
This workshop will provide an overview of common options for publishing sound and video on the Web, focusing on the benefits of various platforms, licensing and rights issues, accessibility issues to consider, and methods of integrating multiple media into research publications. Platforms and tools will include Vimeo, YouTube, SoundCloud, and, for presenting materials in an interactive timeline, Sway and Tableau story feature. Participants will be able to match their digital research with appropriate platforms for public dissemination and will realize the strengths, limitations, and legal issues of various platforms.
Format: This two-hour workshop is meant to promote and engage discussion around students’ specific digital publishing concerns. Consequently, attendance is capped at 15 students, and participants will be asked to share their specific interests and needs ahead of time, to help ensure that presentation examples and discussion points are sufficiently relevant. Sessions will provide numerous examples (projects and tools) to help illustrate key points.
Humanists work with various media, content and materials (sources) as part of their research. These sources can be considered data. This workshop will introduce data management practices for humanities researchers to consider and apply throughout the research lifecycle. Good data management practices pertaining to planning, organization, documentation, storage and backup, sharing, citation, and preservation will be presented through a humanities lens with discipline-based, concrete examples. While general good data management practices are relevant across disciplines, participants working specifically within the humanities are the intended audience for this workshop.
Kick off the new school year with us at our upcoming meeting on Wednesday, September 26th, from 5:30-7pm. We'll be reading selections from award-winning novelist and essayist Roxane Gay's Difficult Women, her debut collection of short fiction.
Although we'll plan to discuss "I Will Follow You," "Difficult Women" and "North Country," you should feel free to read as much or as little (we are low-maintenance, after all) of the work as you'd like.
We are also featuring a giveaway--the first ten people to RSVP for the meeting will receive a free copy of the book!
Mellon's Monograph Initiative and the Reassembling of Scholarly Communications [Munch & Mull Digital Scholarship Discussion Group]
Monographs, long the apex of scholarly accomplishment in the humanities, have faced a crisis in recent years -- from diminishing markets for publishing these scholarly works, to insufficient technologies for capturing and disseminating the range of digitally inflected scholarship emerging from the humanities. The Andrew W. Mellon Foundation's Scholarly Communications Initiative awarded several grants (2014-2015) to encourage innovation and capacity-building in scholarly monograph publishing. "Reassembling Scholarly Communications," a 2017 report on this endeavor published by members of the Canadian Institute for Studies in Publishing in The Journal of Electronic Publishing (volume 20, issue 1), uses final reports from those sponsored projects, conversations with project grantees, and a critical examination of these alongside discourse of the Mellon Foundation, to raise questions and highlight assumptions that underlie this initiative and its results. As part of The Edge's 2017-2018 series, Re: Publishing, this M&M discussion will examine these assumptions and the report findings, consider the implications for humanities publishing, and pose questions for ourselves about how long-form scholarship can be rewarded and sustained.
Thomas More's _Dialogue of Comfort_ Gets Translated Yet Again [Munch & Mull Digital Scholarship Discussion Group]
Mark DeLong (Director, Duke Research Computing) did some "digital humanities" back in 1986 with code developed using Borland's "TurboPascal" and data compiled by counting lines in an Everyman's Library Edition, derived from the 1557 edition of Thomas More's Dialogue of Comfort Against Tribulation. Following established digital humanities practices of the 1980s, DeLong misplaced both the Pascal code and the data he compiled. He did get a couple of pages in his dissertation out of the effort, which he now says was "not a good trade, given the amount of time the data compilation took."
Since 1986, More's work has been made available online (https://www.gutenberg.org/ebooks/17075), and Mark has learned Perl and some Python. In order to redeem himself from the horrors of bad data practices, he's returned to the project and hopes to deepen his now ancient scholarship on More's Dialogue of Comfort. At this Munch & Mull discussion, Mark will outline his current progress and opine about his transition from classic scholar to grey-haired, bow-tied digital hipster, perhaps shedding some light on a direction for (at least his) digital humanities work.