David S. Dubin
School of Information Sciences
University of Illinois at Urbana-Champaign
Champaign, IL 61820
Phone: 217-244-3275
WWW: https://ddubin.web.illinois.edu/ddubin.html
- Ph.D. Information Science, 1996, University of Pittsburgh.
- M.S. Library and Information Science, 1990, Drexel University,
Philadelphia, PA.
- B.S. Humanities and Communications, 1988, Drexel
University.
- University of Illinois at Urbana-Champaign, School of
Information Sciences
- Research Associate Professor, Since August 2007.
- University of Illinois at Urbana-Champaign, Graduate School of
Library and Information Science, Information Systems
Research Laboratory
- Senior Research Scientist, August 2001 - August 2007.
Specialist in electronic publishing, information retrieval,
data analysis, and visualization. Teaching areas: data analysis
and information processing.
- University of Illinois at Urbana-Champaign, Graduate School of
Library and Information Science
- Assistant Professor, September 1996 - August 2001.
Teaching areas: information organization and access,
information storage and retrieval, data structures, library
automation, document processing, networked information systems,
research methods.
- University of Pittsburgh, Department of Information
Science
- Teaching Fellow, September 1995 - August 1996, January 1995 - May 1995,
January 1994 - September 1994, May 1991 - August
1993.
Teaching areas: information retrieval, online searching,
behavioral models, human information processing, systems
analysis.
- University of Pittsburgh, Department of Library Science
- Teaching Fellow, May 1995 - September 1995.
Teaching area: microcomputer applications.
- University of Pittsburgh, Department of Information
Science
- Visiting Lecturer, September 1994 - December 1994.
Teaching area: information storage and retrieval.
- dh Molde College, Molde, Norway
- Research Fellow in Computing Science, August 1993 - December 1993.
Teaching area: human information processing.
- University of Pittsburgh, Department of Library Science
- Adjunct Instructor, September 1992 - December 1992.
Teaching area: microcomputer applications.
- University of Pittsburgh, Department of Information
Science
- Graduate Student Assistant, September 1990 - May 1991.
- Drexel University, Philadelphia, PA, College of Information
Studies
- Graduate Assistant, September 1988 - June 1990.
- Delaware Valley Transplant Program, Philadelphia, PA
- Cooperative Education Intern (technical communication), November 1984 - June 1987.
- Larry S. Jackson (Library and Information Science,
University of Illinois at Urbana-Champaign, 2009) Website
Structure.
- Karen M. Wickett (Library and Information Science,
University of Illinois at Urbana-Champaign, 2012)
Collection/Item Metadata Relationships.
- Jin Ha Lee (Library and Information Science, University of
Illinois at Urbana-Champaign, 2008) Analysis of Information
Features in Natural Language Queries for Music Information Retrieval:
Use Patterns and Accuracy.
- Jun Wang (Library and Information Science, University of Illinois
at Urbana-Champaign, 2006) Computational Approaches to Linguistic
Consensus.
- Bei Yu (Library and Information Science, University of
Illinois at Urbana-Champaign, 2006) An Evaluation of Text
Classification Methods for Literary Study.
- Qin He (Library and Information Science, University of Illinois
at Urbana-Champaign, 2000) Component Study of Co-Word
Analysis.
- Jianhua Dong (Library and Information Science, University
of Illinois at Urbana-Champaign, 2000) Combination of Multiple
Web Search Results and its Effect on the Search Performance.
- Damien Guillaume (Astronomy, University of Strasbourg, 2000)
Distributed Information Retrieval, Search and Processing in
Astronomy.
- Co-Chair (with Bridget Almas and Sayeed Choudhury) Research
Data Provenance Interest Group, Research Data Alliance, September 2013
- present.
- Co-Chair (with Carolyn Anderson), Annual Meeting of the
Classification Society of North America, June 2007.
- Board of Directors, Classification Society of North America,
2002-2004.
- Editorial Board, Classification Literature Automated Search
Service, Classification Society of North America.
- Local Organizing Committee, Ninth Biennial Meeting of the
International Federation of Classification Societies,
July 2004.
- Program Committee, Annual Meeting of the Classification Society
of North America, June 1998.
- Demonstrations Chair, Second ACM International Conference on
Digital Libraries, July 1997.
- Local Arrangements Committee, Annual Meeting of the
Classification Society of North America, June 1993.
- Registration Chair, Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval, June
1993.
- Monica Berti, Bridget Almas, David Dubin,
Greta Franzini, Simona Stoyanova, and Gregory R. Crane.
The linked fragment: TEI and the
encoding of text reuses of lost authors.
Journal of the Text Encoding Initiative, (8), 2015.
- David Dubin and Jacob Jett.
An ontological framework
for describing games.
In Proceedings of the 2015 Joint Conference on Digital Libraries,
New York, 2015.
- David Dubin, Megan Senseney, and
Jacob Jett.
What it is vs. how we
shall: complementary agendas for data models and architectures.
In Proceedings of Balisage: The Markup Conference 2013, volume 10
of Balisage Series on Markup Technologies, Montréal, Canada,
August 2013.
- Andrea K. Thomer, Karen S. Baker, Simone
Sacchi, and David Dubin.
Completeness, coverage &
equivalence in scientific data records.
Proceedings of the American Society for Information Science and
Technology, 49, 2012.
- Karen M. Wickett, Simone Sacchi, David
Dubin, and Allen H. Renear.
Identifying content and
levels of representation in scientific data.
Proceedings of the American Society for Information Science and
Technology, 49, 2012.
- David Dubin, Karen M. Wickett, and
Simone Sacchi.
Content, format, and
interpretation.
In Proceedings of Balisage: The Markup Conference 2011, volume 7
of Balisage Series on Markup Technologies, Montréal, Canada,
August 2011.
- Simone Sacchi, Karen Wickett, Allen
Renear, and David Dubin.
A framework for
applying the concept of significant properties to datasets.
In Proceedings of the American Society for Information Science and
Technology, volume 48 of ASIS&T Annual Meeting
Proceedings. American Society for Information Science and Technology,
2011.
- David Dubin.
Encoded descriptions at face
value.
In Andrew Grove, editor, Proceedings of the 73rd Annual Meeting of the
American Society for Information Science and Technology, volume 47 of
ASIS&T Annual Meeting Proceedings, Pittsburgh, PA, October
2010. Information Today, Inc.
- Allen H. Renear, Karen M. Wickett,
Richard J. Urban, David Dubin, and Sarah L. Shreeves.
Collection/item metadata
relationships.
In Jane Greenberg and Wolfgang Klas, editors, Proceedings of the
International Conference on Dublin Core and Metadata Applications,
Berlin, pages 80-89, Goettingen, September 2008. Dublin Core Metadata
Initiative, Goettingen University Press.
- Allen H. Renear and David Dubin.
Three of the four FRBR group 1
entity types are roles, not types.
In Andrew Grove, editor, Proceedings of the 70th Annual Meeting of the
American Society for Information Science and Technology, Medford, NJ,
2007. Information Today, Inc.
- David Dubin, Joe Futrelle, and Joel
Plutchak.
Metadata enrichment for digital
preservation.
In B. T Usdin, editor, Proceedings of Extreme Markup Languages
2006, Montreal, Quebec, August 2006.
- David Dubin and David Birnbaum.
Interpretation beyond markup.
In B. T Usdin, editor, Proceedings of Extreme Markup Languages
2004, Montreal, Quebec, August 2004.
- David Dubin.
Object mapping for markup
semantics.
In B. T Usdin, editor, Proceedings of Extreme Markup Languages
2003, Montreal, Quebec, August 2003.
- David Dubin, C. M. Sperberg-McQueen, Allen
Renear, and Claus Huitfeldt.
A logic
programming environment for document semantics and inference.
Literary and Linguistic Computing, 18(2):225-233, 2003.
(This is a corrected version of an article that appeared in 18:1 pp. 39-47).
- Jonghoon Lee and David Dubin.
Vocabulary mapping
in the NASA ADS: Prospects for practical subject access.
In B. Corbin, E. Bryson, and M. Wolf, editors, Library and Information
Services in Astronomy IV, pages 249-256, Washington, DC, 2003. U.S.
Naval Observatory.
- Allen Renear and David Dubin.
Towards identity conditions for
digital documents.
In S. Sutton, editor, Proceedings of the 2003 Dublin Core
Conference, Seattle, WA, October 2003. University of Washington.
- Allen Renear, Christopher Phillippe, Pat
Lawton, and David Dubin.
An XML document corresponds to
which FRBR Group 1 entity?.
In B. T Usdin, editor, Proceedings of Extreme Markup Languages
2003, Montreal, Quebec, August 2003.
- Allen Renear, David Dubin, C. M.
Sperberg-McQueen, and Claus Huitfeldt.
XML semantics and
digital libraries.
In C. C. Marshall, G. Henry, and L. Delcambre, editors, Proceedings of
the third ACM/IEEE-CS joint conference on Digital libraries, pages
303 - 305, Los Alamitos, CA, 2003. IEEE.
- C. M. Sperberg-McQueen, David Dubin,
Claus Huitfeldt, and Allen Renear.
Drawing inferences on the basis of
markup.
In B. T Usdin and S. R. Newcomb, editors, Proceedings of Extreme Markup
Languages 2002, Montreal, Quebec, August 2002.
- Allen Renear, David Dubin, C. M.
Sperberg-McQueen, and Claus Huitfeldt.
Towards a semantics for
XML markup.
In R. Furuta, J. I. Maletic, and E. Munson, editors, Proceedings of the
2002 ACM Symposium on Document Engineering, pages 119-126, McLean,
VA, November 2002. Association for Computing Machinery.
- Jonghoon Lee, David S. Dubin, and
Michael J. Kurtz.
Co-occurrence
evidence for subject vocabulary reconciliation in ADS databases.
In D. M. Mehringer, R. L. Plante, and D. A. Roberts, editors,
Astronomical Data Analysis Software and Systems VIII, volume 172
of A.S.P. Conference Series, pages 287-290, San Francisco, 1999.
Astronomical Society of the Pacific.
- Jonghoon Lee and David Dubin.
Context-sensitive vocabulary
mapping with a spreading activation network.
In M. Hearst, F. Gey, and R. Tong, editors, Proceedings of the 1999 ACM
SIGIR Conference on Research and Development in Information Retrieval,
pages 198-205, New York, 1999. Association for Computing Machinery, ACM.
- David S. Dubin.
Addressing the
heterogeneity of subject indexing in the ADS databases.
In U. Grothkopf, H. Andernach, S. Stevens-Rayburn, and M. Gomez, editors,
Library and Information Services in Astronomy III, volume 153
of A.S.P. Conference Series, pages 77-83, San Francisco, 1998.
Astronomical Society of the Pacific.
- D. Dubin.
The search for structure and the search for meaning.
In R. Schwartz, editor, Advances in Classification Research Volume
6, pages 13-20. Information Today, Inc., Medford, NJ, 1998.
- R. R. Korfhage, D. Dubin, and E. M.
Housman.
Computer-aided interactive classification: applications of VIBE.
In P. Solomon, editor, Advances in Classification Research Volume
7, pages 83-101. Information Today, Inc., Medford, NJ, 1998.
- R. R. Korfhage, D. Dubin, and E. M.
Housman.
What good is visualization: three
experiments.
In R. F. Erbacher and E. Pang, editors, Visual Data Exploration and
Analysis V, pages 196-207, Bellingham, WA, 1998. SPIE.
- D. Dubin, B. H. Kwasnik, and
C. Tangmanee.
Elicitation techniques for classification research.
In R. Fidel, C. Beghtol, B. H. Kwasnik, and P. J. Smith, editors,
Advances in Classification Research Volume 5, pages 33-68.
Information Today, Inc., Medford, NJ, 1996.
- D. S. Dubin.
Attribute selection for visualizing multidimensional document spaces: a
progress report.
In D. S. Ebert, editor, Workshop on New Paradigms in Information
Visualization and Manipulation (NPIV '96), pages 8-11, 1996.
- D. Dubin.
Document analysis for
visualization.
In Proceedings of the Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval, pages 199-204, New
York, 1995. ACM SIGIR, Association for Computing Machinery.
- D. Dubin.
Applying similarity measures to texts.
TEXT Technology, 4(4):283-291, 1994.
- K. A. Olsen and D. Dubin.
Maintaining a personal reference library with a word processor and a scanner.
TEXT Technology, 4(2):149-154, 1994.
- I. Y. Song and D. Dubin.
An intensional query processor in prolog.
In M. H. Hamza, editor, Computer Applications in Design, Simulation, and
Analysis, pages 204-207, Anaheim, CA, 1991. ACTA Press.
- D. Dubin, J. Futrelle, J. Plutchak, and
J. Eke.
Preserving meaning, not just
objects: semantics and digital preservation.
Library Trends, 57(3):595-610, 2009.
- Allen H. Renear and David Dubin.
FRBR as
an interdisciplinary high-middle range theory.
In Angela De Cenzo, editor, Proceedings of iConference 2008, Los
Angeles, 2008.
- D. Dubin.
The most influential paper Gerard
Salton never wrote.
Library Trends, 52(4):748-764, 2004.
- D. S. Dubin, A. Renear, and C. M.
Sperberg-McQueen.
Addressing
obstacles to the retrieval of structured documents.
Technical Report UIUCLIS- -2003/1+EPRG, Graduate School of Library and
Information Science, University of Illinois at Urbana-Champaign, Champaign,
IL, 2003.
- D. Dubin.
Standards and information.
In J. R. Schement, editor, Encyclopedia of Communication and
Information, volume 3, pages 965-967. Macmillan, New York, 2002.
- D. Dubin.
Toward
more robust discrimination-based indexing models.
Technical Report UIUCLIS- -1999/7+IRG, Graduate School of Library and
Information Science, University of Illinois at Urbana-Champaign, Champaign,
IL, 1999.
- D. Dubin.
Dimensions and discriminability: The role of controlled vocabulary in
visualizing document associations.
In P. A. Cochrane and E. H. Johnson, editors, Visualizing Subject Access
for 21st Century Information Resources, pages 39-44. University of
Illinois Graduate School of Library and Information Science, Champaign, IL,
1998.
- D. Dubin.
Further
cautions for the calculation of discrimination values.
Technical Report UIUCLIS- -1999/3+IRG, Graduate School of Library and
Information Science, University of Illinois at Urbana-Champaign, Champaign,
IL, 1998.
- D. Dubin.
Measurement in information science (book review).
Journal of Classification, 14(2):327-330, 1997.
- D. Dubin.
Structure in Document Browsing
Spaces.
PhD thesis, University of Pittsburgh, 1996.
- D. Dubin.
Multimedia and imaging databases (book review).
Information Processing and Management, 32(6):769-770, 1996.
- D. Dubin.
Search strategies for Internet resources.
School Library Media Quarterly, 24(1):53-54, 1995.
- M. B. Spring and D. Dubin.
Hands-on PostScript.
Hayden Books, Carmel, IN, 1992.
(Published in Polish translation by Intersoftland of Warsaw as PostScript od A do Z).
- D. Dubin.
Online databases put a new universe of resources at our fingertips.
TIES Magazine, pages 38-43, Sept./Oct. 1989.
- Bridget Almas, Monica Berti, Sayeed
Choudhury, David Dubin, Megan Senseney, and Karen M. Wickett.
Representing
humanities research data using complementary provenance models.
Presented at Building Global Partnerships: Research Data Alliance Second
Plenary Meeting, Washington, DC, September 2013.
- David J. Birnbaum, David Dubin, and
Cynthia Vakareliyska.
Clustering calendars of saints: discriminatory power and variable selection.
Presented at the annual meeting of the Classification Society, Milwaukee, WI,
June 2013.
- Karen Wickett, David Dubin, Bridget
Almas, and Megan Senseney.
Extending the systematic assertion model for humanities research.
Presented at the 2013 ASIS&T Research Data Access and Preservation Summit,
April 2013.
- David Dubin.
On data identity and scientific equivalence.
Presented at the annual meeting of the Classification Society, Pittsburgh,
PA, June 2012.
- David Dubin.
Internal
cohesion and external separation.
Presented at the 23rd Annual ASIS&T SIG/CR Classification Research Workshop,
October 2012.
- Carole L. Palmer, Tiffany C. Chao,
Nicholas M. Weber, Simone Sacchi, Karen M. Wickett, Allen H. Renear, Karen
Baker, Andrea Thomer, and David Dubin.
Integrating conceptual and empirical
studies of data to guide curatorial processes.
Presented at the 2012 ASIS&T Research Data Access and Preservation Summit,
March/April 2012.
- K. M. Wickett, S. Sacchi,
D. Dubin, and A. H. Renear.
Representing
identity and equivalence for scientific data.
Presented at the American Geophysical Union (AGU) Fall Meeting, December
2012.
- Karen M. Wickett, Andrea Thomer, Simone
Sacchi, Karen S. Baker, and David Dubin.
What dataset descriptions actually
describe: Using the systematic assertion model to connect theory and
practice.
Presented at the 2012 ASIS&T Research Data Access and Preservation Summit,
March/April 2012.
- D. Dubin.
Data theory and scientific data management.
Presented at the annual meeting of the Classification Society, Pittsburgh,
PA, June 2011.
- Simone Sacchi, Allen Renear, and David
Dubin.
One thing missing or two things
are confused: An analysis of OAIS representation information.
Presented at the International Digital Curation Conference (IDCC), December
2011.
- D. Dubin.
On the expressive content of games.
Presented at the 2009 Joint Conference of the National Popular Culture and
American Culture Associations, New Orleans, April 2009.
- D. Dubin.
Challenges for board game classification.
Presented at the 2008 ALISE Annual Conference, Philadelphia, PA., January
2008.
- David Dubin and David J. Birnbaum.
Reconsidering conventional markup for knowledge representation.
Presented at Balisage: the Markup Conference, Montreal., August 2008.
- Allen H. Renear, Richard J. Urban,
Karen M. Wickett, Carole L. Palmer, and David Dubin.
Sustaining collection value: Managing
collection/item metadata relationships.
Presented at Digital Humanities 2008, Oulu, Finland, June 2008.
- Allen H. Renear, Karen M. Wickett,
Richard J. Urban, and David Dubin.
The return of the trivial:
problems formalizing collection/item metadata relationships.
In JCDL '08: Proceedings of the 8th ACM/IEEE-CS joint conference on
Digital libraries, pages 464--464, New York, NY, USA, 2008.
Association for Computing Machinery, ACM.
- Allen Renear, David Dubin, and Karen
Wickett.
When digital objects change --- exactly what changes?
Presented at the 2008 Annual Meeting of the American Society for Information
Science and Technology, Columbus, OH, October 2008.
- D. Dubin.
Instance or expression? another look at reification.
Presented at Extreme Markup Languages 2007, Montreal., August 2007.
- D. Dubin.
Reframing author cocitation analysis.
Presented at the tenth biennial meeting of the International Federation of
Classification Societies, Ljubljana, Slovenia, July 2006.
(also presented at the 2006 meeting of the Classification Society of North
America, Piscataway, NJ.
- Claus Huitfeldt, Michael
Sperberg-McQueen, David Dubin, and Lars G. Johnsen.
Markup languages for complex documents: an interim project report.
Presented at Digital Humanities, Paris, July 2006.
- J. S. Downie, A. Renear, A. Mathes,
K. Medina, D. Dubin, and J. H. Lee.
Modelling complex multimedia
relationships in the humanities computing context: Are Dublin Core and
FRBR up to the task?.
Presented at ALLC/ACH, Victoria, British Columbia, June 2005.
- D. Dubin and D. J. Birnbaum.
A declarative framework for modeling
pronunciation and rhyme.
Presented at ALLC/ACH, Victoria, British Columbia, June 2005.
- D. Dubin.
Unpacking the interpretation of METS markup.
Presented at the Digital Library Federation Fall Forum, Charlottesville, VA,
November 2005.
- D. Birnbaum and D. Dubin.
Measuring similarity in the contents of medieval miscellany manuscripts.
Presented at the ninth biennial meeting of the International Federation of
Classification Societies, Chicago, IL, July 2004.
- D. Dubin.
Semantic markup or markup semantics?
Presented at the University of Pittsburgh School of Information Sciences
Colloquium Series, March 2004.
- D. Dubin and J. Lee.
Beyond three dichotomies.
Presented at the annual meeting of the Classification Society of North America,
Tallahassee, FL, June 2003.
- D. Dubin, G. Ripoche, and L. Gasser.
Organizational dynamics of software development.
Presented at the DIMACS Workshop on Algorithms for Multidimensional Scaling,
Tallahassee, FL, June 2003.
- C. M. Sperberg-McQueen, A. Renear,
C. Huitfeldt, and D. Dubin.
Skeletons in the closet: Saying what markup means.
Presented at ALLC/ACH, Tübingen, Germany, July 2002.
- D. Dubin.
Do conceptual spaces have metric structure?
Presented at the annual meeting of the Classification Society of North America,
Madison, WI, June 2002.
- D. Dubin.
Model, data, and system centered perspectives in information retrieval
research.
Presented at the annual meeting of the Classification Society of North America,
St. Louis, MO, June 2001.
- D. Dubin and M. Rorvig.
Classical two-way metric MDS adapted to handle very large datasets.
Presented at the DIMACS Workshop on Algorithms for Multidimensional Scaling,
Rutgers University, Piscataway, NJ, August 2001.
- D. Dubin and D. X. Pape.
Clustering applications and validation: A case study with the Kohonen SOM.
Presented at the annual meeting of the Classification Society of North America,
Montreal, Canada, June 2000.
- D. Dubin and D. X. Pape.
Validation strategies for large-scale clustering applications.
Presented at the annual meeting of the Classification Society of North America,
Pittsburgh, PA, June 1999.
- D. Dubin.
Challenges for the future of document clustering.
Presented at the annual meeting of the Classification Society of North America,
Washington, DC, June 1997.
(Also presented at the 1998 meeting of the International Federation of
Classifcation Societies, Rome, Italy).
- D. Dubin.
Clustering tendency and the cluster hypothesis in information retrieval.
Presented at the annual meeting of the Classification Society of North America,
Amherst, MA, June 1996.
- D. Dubin.
POI discovery and the clarity of VIBE displays.
Presented at the annual meeting of the Classification Society of North America,
Houston, TX, June 1994.