A pipeline for the extraction of information about biomaterials from PubMed abstracts into a MongoDB collection