Welcome to the Methods Commons

Methodica is a collection of research methods and techniques for analyzing text. Computation has produced new and exciting ways of studying text in the Digital Humanities, and many of these methods do not require the use of expensive programs or detailed programming knowledge. This site describes common or interesting sequences of actions, or recipes, showing users how to combine freely accessible resources to perform various analytic tasks.

Creating a Basic Web Scraper with Python

This utility is for creating a simple web scraper with Python.

Concordance – Plain Text (TAPoRware)

Concordance (Plain Text) is a free, web-based tool designed to run in a browser window. It is easy to use, designed to locate and contextualize a user-specified word or pattern (in this case, a regular expression) within a document, either hosted at a web address or uploaded from the user's files.

Fixed Phrase – HTML (TAPoRware)

Fixed Phrase (HTML) is a free, web-based tool designed to run in a browser window. It is easy to use, designed to search for two words or patterns within a user-specified distance apart and show all instances matching the specifications within a document, either hosted at a web address or uploaded from the user's files.

XTRACT

XTRACT was a lexical collocation tool developed by Frank Smadja in the early 1990s that used statistical techniques for retrieving and identifying collocations in a large textual corpora (Smadja, "XTRACT" 399).

Pages