Skip to main content
U.S. flag

An official website of the United States government

Return to search results
💡 Advanced Search Tip

Search by organization or tag to find related datasets

Agricultural Research Word Vectors

Published by Agricultural Research Service | Department of Agriculture | Metadata Last Checked: June 24, 2025 | Last Modified: 2024-02-15
<p>This model was originally trained for use in a recommendation system to the Ag Data Commons that will automatically link viewers of one dataset to other directly relevant datasets and research papers that they may be interested in. It was also used to determine the similarities and differences between projects within ARS’ National Programs and create a visualization layer to allow leaders to explore and manage their programs easily.</p> <p>This model was generated using the Word2Vec model, starting with a set of <a href="https://drive.google.com/file/d/0B7XkCwpI5KDYNlNUTTlSS21pQmM/edit">word vectors trained on Google News articles</a>, and further training it on the titles+abstracts from <a href="https://pubag.nal.usda.gov/apidocs">PubAg</a> and the titles+descriptions from <a href="https://data.nal.usda.gov/data.json">Ag Data Commons</a>. This model was trained using a vector length of 300 and the Continuous Bag of Words version of the algorithm with negative sampling.</p> <p>This word vector model could be used for any Natural-Language Processing applications involving text with a large amount of agricultural research vocabulary. </p><div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Agricultural Word Vectors.</p> <p>File Name: AgWordVectors-300.zip</p><p>Resource Description: Word vectors trained on the full titles/abstracts in PubAg and titles/abstracts in Ag Data Commons. (Part A)</p></li><br><li><p>Resource Title: Agricultural Word Vectors Trainables.</p> <p>File Name: AgWordVectors-300.model<em>.trainables.syn1neg.zip</em></p><p><em>Resource Description: Word vectors trained on the full titles/abstracts in PubAg and titles/abstracts in Ag Data Commons. (Part B)</em></p></li><em><br></em><li><em><p>Resource Title: Agricultural Word Vector Model.</p> </em><p><em>File Name: AgWordVectors-300.model</em>.wv_.vectors.zip</p><p>Resource Description: Word vectors trained on the full titles/abstracts in PubAg and titles/abstracts in Ag Data Commons. (Part C)</p></li></ul><p></p>

Find Related Datasets

Click any tag below to search for similar datasets

Complete Metadata

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov