Skip to Main Content

Manage Your Research Data: Resources

Several resources and tools are available to facilitate the text mining process, each offering unique features and capabilities. Here are some valuable text mining resources available for you:

All Elsevier journals and books are able to be text data mined. The Elsevier API enables researchers to bulk download the content they would like to analyse allowing greater efficiency and consistency in the process. Find out more information about the API and links to the developer portal where you may download the API for non commercial research on the Elsevier text and data mining page. The API may also be used to mine metadata such as titles and abstracts indexed in the Scopus database.


 

Text mining within PubMed has been made possible with the availability of many free tools that make seeing the interactions and links between terms within research easy. Enabling the researcher to gain insights and see patterns in their area of research.

PubVenn

Generates a Venn diagram and list of articles based on search terms and articles listed in PubMed.

PubReMiner

Ranks the frequency of words and terms found in abstracts and titles for articles indexed in PubMed in a table display. Other frequency displays include the journals and authors most associated with the terms. You are also able to lookup human gene names.

Coremine Medical

Requires a free account to use. Results are displayed on a dashboard showing relationships in a graphic network. 

MeSH on demand

Enter free text to highlight and identify MeSH terms, PubMed articles that are identified as being similar to the text will also be displayed.


 

VOSviewer

Visualise patterns and relationships within a bibliometric network. Featuring the ability to form networks based on authors, journals and other bibliometric details as well as text mining for the relationships between terms.

The resources mentioned above are subscribed to by ECU. However, it's important to note that not all resources permit text and data mining (TDM). Some may require you to obtain permission before proceeding, and publishers might also charge a fee for text mining their content.

Voyant tools

Input includes HTML, .txt, .pdf, and Word documents, no programing experience is required. Voyant analyses your text and provides an analysis in a dashboard format with interactivity between the various tools. This free web based tool is a text reading and analysis environment, enabling visualisation and interpretation of text for scholarly purposes.

The getting started in Voyant guide

Text Analyser

Drag and drop documents into the analyser provided by JSTOR, with several types of formats accepted for input. Analyses the text for key terms and topics and suggest related articles from JSTOR.

Lexos

A text preparation and analysis tool enabling you to clean, scrub and section your text before analysis. You are then able to perform cluster analysis and visualisations from your data.

Constellate, the new text and data analytics service from JSTOR and Portico is a platform for learning and performing text analysis, building datasets, and sharing analytics course materials.

Goldstone-Underwood Stoplist

A list of stop words which may be adapted for your project. Stop words are words that contain little useful information in an analysis that you wish your analysis to ignore.