How the DTIC Thesaurus and Metatagger Application Support the Indexing Process at DTIC
URI / Handle
The Defense Technical Information Center (DTIC) has served the research and information needs of the Defense community for more than 70 years since the end of WW2. DTIC has developed a number of tools to help manage this vast repository of information of over 4 million records, including the DTIC Thesaurus and Metatagger application. The DTIC Thesaurus is a controlled vocabulary of approximately 18,000 terms organized in hierarchical format along with a Thesaurus lexical evidence file of approximately 65,000 more granular terms tied to these thesaurus terms. These terms are also tied to a broader Subject Field and Group taxonomy dating back decades. The DTIC Thesaurus controlled vocabulary feeds into DTIC’s Metatagger application, which semantically analyzes documents and then outputs ranked tag listings of concepts and topics summarizing the main themes of that document. DTIC’s Content Analysts use these suggested metadata terms for indexing documents coming into DTIC’s repository.