Data Set Representation and Tagging for Automating Data Cataloging