Automatically assembling a full census of an academic field

04/08/2018
by   Allison C. Morgan, et al.
0

The composition of the scientific workforce shapes the direction of scientific research, directly through the selection of questions to investigate, and indirectly through its influence on the training of future scientists. In most fields, however, complete census information is difficult to obtain, complicating efforts to study workforce dynamics and the effects of policy. This is particularly true in computer science, which lacks a single, all-encompassing directory or professional organization. A full census of computer science would serve many purposes, not the least of which is a better understanding of the trends and causes of unequal representation in computing. Previous academic census efforts have relied on narrow or biased samples, or on professional society membership rolls. A full census can be constructed directly from online departmental faculty directories, but doing so by hand is prohibitively expensive and time-consuming. Here, we introduce a topical web crawler for automating the collection of faculty information from web-based department rosters, and demonstrate the resulting system on the 205 PhD-granting computer science departments in the U.S. and Canada. This method constructs a complete census of the field within a few minutes, and achieves over 99 census to a hand-curated 2011 census to quantify turnover and retention in computer science, in general and for female faculty in particular, demonstrating the types of analysis made possible by automated census construction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2018

CSIndexbr: Exploring the Brazilian Scientific Production in Computer Science

CSIndexbr is a web-based system that provides meaningful,open,and transp...
research
05/03/2022

Why The Trans Programmer?

Through online anecdotal evidence and online communities, there is an in...
research
08/15/2006

Tarski's influence on computer science

The influence of Alfred Tarski on computer science was indirect but sign...
research
06/16/2022

Industrial Limitations on Academic Freedom in Computer Science

The field of computer science is perhaps uniquely connected with industr...
research
07/06/2021

Visions in Theoretical Computer Science: A Report on the TCS Visioning Workshop 2020

Theoretical computer science (TCS) is a subdiscipline of computer scienc...
research
03/12/2017

Research Methods in Computer Science: The Challenges and Issues

Research methods are essential parts in conducting any research project....
research
02/07/2023

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study

Due to the exponential growth of scientific publications on the Web, the...

Please sign up or login with your details

Forgot password? Click here to reset