[XML4Lib] mods: the new marc?
Kyle Fenton
jfenton at emory.edu
Mon Dec 17 13:40:22 EST 2007
Hi Eric,
If you choose MODS you'd have as company the Digital Library
Federation, who have developed a MODS profile for its Aquifer project:
http://wiki.dlib.indiana.edu/confluence/display/DLFAquifer/DLF+Aquifer+Public+Metadata+Documents
If you want to store extended information that MODS does not cover,
you might consider wrapping MODS inside one or another of the METS
profiles already available.
--
Kyle Fenton
Emory University Libraries
P.S. -- you may be interested in the current email threads on NGC4LIB
regarding next generation cataloging issues (esp. FRBR)
On Dec 16, 2007, at 4:34 PM, Eric Lease Morgan wrote:
>
> Is MODS the new MARC?
>
> As you may or may not know, I advocate "catalogs" include content
> beyond the things a library owns or licenses. Moreover, I advocate
> libraries take a more active role in collecting and providing
> services against information resources no matter where they reside
> on a network. Don't get me wrong, I don't advocating "cataloging"
> the entire Internet, but I do advocate actively collecting materials
> apropos to the needs of a particular library's patrons.
>
> In an effort to demonstrate such an idea I would like to collect and
> provide services against a number of different types of data/
> information freely available on the 'Net. Some of these things
> include but are not to the following listed in no priority order:
> electronic books/texts (Project Gutenberg, University of Michigan
> MBooks, Open Content Alliance, etc.), electronic journals from DOAJ,
> electronic journal articles from DOAJ Articles, pre-prints and post-
> prints from various OAI repositories, mailing list messages,
> selected blog postings, theses & dissertations from NDLTD, etc.
>
> Each of the things above can be systematically harvested through the
> use of OAI, simple Web crawling, or the retrieval of data sets. Once
> harvested the data could be stored in a database and/or indexed
> providing the means for discovery and services. The storage of this
> content in a database begs questions regarding tables, records, and
> fields. What might they be? Similarly, unless the index is going to
> be 100% free text, the harvest content/metadata will need to mapped
> to fields. Again, what fields?
>
> I'm not so naive to believe there is such a thing a the perfect
> database structure for this "catalog", nor do I believe free text
> indexing is the answer either. So, what sort of data structure
> should I use? Not MARC. MODS? Some incarnation of RDF?
>
> If I go this route I see the following plan:
>
> 0. Articulate a collection policy.
> 1. Acquire/harvest content in its raw form.
> 2. Convert the raw content into MODS, RDF, or
> something else.
> 3. Save/archive the raw data because things get lost
> in translation.
> 4. Save the MODS or RDF to a (XML) database.
> 5. Parse the MODS or RDF and save it to a
> (relational) database.
> 6. Run scripts against the database to create things
> like browsable lists, create new relationships
> between items, or simply enhanced.
> 7. Index the MODS or RDF, or write a report against
> the database intended for indexing.
> 8. Provide access to the index (via SRU, OpenSearch,
> or Z39.50).
> 9. Provide services against the search results such
> as Get It, Review It, Buy It, Bookmark It, Compare It
> To Other things, etc.
> 10. Got to Step #1.
>
> Assuming there is no single database structure for such a idea, what
> flavor of XML would you use as your canonical data format? MODS?
> RDF? Something else?
>
> --
> Eric Lease Morgan
> University Libraries of Notre Dame
>
> _______________________________________________
> XML4Lib mailing list
> XML4Lib at webjunction.org
> http://lists.webjunction.org/mailman/listinfo/xml4lib
>
>
More information about the XML4Lib
mailing list