[XML4Lib] Re: tei2marcxml

Michelle Dalmau mdalmau at indiana.edu
Fri Jun 19 15:30:17 EDT 2009


Hi Eric,

To add to Kevin's message, at IU are in the process of implementing a  
basic MARC-to-TEI Header extraction based on a MARC Extraction style  
sheet developed by UIUC.  This MARC-to-TEI Header process is part of a  
larger Java program that generates a "TEI Shell" for our encoding  
projects.  Jenn Riley, our Metadata Librarian, and David Jiao, E-Text  
Programmer, are the masterminds behind this, but I am in the midst of  
testing our new workflow and will be documenting the process on our  
publicly accessible wiki page:
<https://wiki.dlib.indiana.edu/confluence/x/OQvpHQ>.  In the meantime,  
you can access the MARC extraction style sheet and our "template TEI"  
style sheet (and other style sheets) that make this happen.

In our MARC to TEI Header mapping work, we referenced the new Best  
Practices Kevin speaks of to guide this work although the precise  
mappings are still a work in progress.  I along with Kevin, Jenn and  
others have been involved in the planning and authoring work so we  
used some of our insider knowledge to get going with our MARC  
extraction.

Our first project guinea pig is actually a P4 TEI project for which we  
have a general Header template that details which bits are internal  
boilerplate, MARC derived and encoder generated.  If that would help  
you, I can generate a PDF of that documentation (it's password  
protected on our wiki).  However, this larger Java program will work  
with P4 pr P5 versions of the TEI and with both MARC-based and non- 
MARC-based metadata.

--Michelle

On Jun 19, 2009, at 12:15 PM, Kevin Hawkins wrote:

> Eric,
>
> You probably know this, but you can't write a generic script to  
> convert TEI headers to MARCXML because the TEI header is so flexible  
> and open to interpretation in its use.  So you really need something  
> project- or collection-specific that takes into account local  
> metadata practices.
>
> That said, I am involved in a revision of the very outdated "TEI  
> Text Encoding in Libaries: Guidelines for Best Encoding Practices" ( http://www.diglib.org/standards/tei.htm 
>  ).  The new version, renamed "Best Practices for TEI in  
> Libraries" ( http://purl.oclc.org/NET/teiinlibraries ) will include  
> MARC field mappings for header elements.  These are currently being  
> drafted, and we plan to have these in place by early July.  This  
> info might save you some time in writing your own stylesheet, noting  
> of course that the TEI encoding in the Best Practices will no doubt  
> differ from your local practice.  We hope this document will become  
> an approved TEI customization, available on the TEI website along  
> with stylesheets for MARCXML-->TEI and TEI-->MARCXML (which someone  
> will need to write), but this is all many months away from happening.
>
> More information on our work and timeline is at
>
> http://wiki.tei-c.org/index.php/TEI_in_Libraries:_Guidelines_for_Best_Practices_Working_Group
>
> If you are interested in writing stylesheets for this, we would very  
> much welcome your help!
>
> Kevin
> Member of the TEI in Libraries: Guidelines for Best Practices  
> Working Group
> Co-Convenor of the SIG on Libraries of the TEI Consortium
>
>
> _______________________________________________
> XML4Lib mailing list
> XML4Lib at webjunction.org
> http://lists.webjunction.org/mailman/listinfo/xml4lib
>





More information about the XML4Lib mailing list