Thursday, August 14, 2008

EAD to DC records

FIELDS STANDARD FROM RECORD TO RECORD

dc:creator ==> Georgia. Superior Court (Troup County)

dc:publisher ==> [Athens, Ga.] : Digital Library of Georgia in association with the Troup County Archives

dc:date ==> 2008

dc:coverage.spatial ==> Troup County (Ga.)

dc:contributor ==> Digital Library of Georgia
==> Troup County Archives

dc:type ==> Judicial records
==> Civil court records

dc:rights

FIELDS PULLING DATA FROM PROCESSING OF EAD INSTANCE

record id: filename of DC record

dc:identifier ==> from first daoloc

dc:title ==> <unittitle>, Troup County, Georgia, <unitdate>

dc:coverage.temporal ==> from <unitdate> in ISO8601b format

dc:subject ==> from subject mappings based on <unittitle>

dc:source ==> Troup County, Georgia Superior Court loose records, Series V: Superior Court Sessions, 1827-1900, <container type="box">, <container type="folder">, Troup County Archives, LaGrange, Georgia.

Wednesday, August 13, 2008

Subject analysis/mapping

Once scanning and mark-up is complete, the Digital Library of Georgia will ingest item-level DC records for each file-unit (i.e., <c0x level="file"> with a <daogrp>). These DC records will include subject analysis. To assign headings to each of these DC files, I'll use the <unittitle> of the file unit and inherited data. The file units being scanned come from the Superior Court record group's series 5, Court Sessions. This series is arranged by court session. Generally speaking, the mark-up falls into the following hierarchy.

<c03>year
<c04>session (e.g, March 1830)
<c05>type of court (either civil, criminal, or court business)
<c06>file unit

My first task is to split the mark-up of series 5 into <c05>s so I can filter the records into categories. Using scripts I'd written to split apart records harvested from OAI servers, I split the mark-up for series 5. Next, I wrote a script to separate each of these files into the folder that reflected the type of court. When creating the DC records, each file unit will receive headings that reflect which type of court the record is from. For example, civil court records will all receive

Georgia. Superior Court (Troup County)
Courts--Georgia--Troup County
Judgments--Georgia--Troup County
Justice, Administration of--Georgia--Troup County

and all criminal ones

Georgia. Superior Court (Troup County)
Criminal courts--Georgia--Troup County
Judgments, Criminal
Criminal law--Cases
Criminal justice, Administration of--Georgia--Troup County

To add the finer grained subject analysis, I depend on the title. So, I again split the records. This time into the file units and rename them based on the first daoloc in the record. (Rename script is also one I regularly use when harvesting OAI records.) Next, I used a script that lists out the unittitles of the file units. For the civil court cases, I got the following "basic" unittitles. Some of the title words are unique, and yet others are used for over 100 records.

  1. Adoption
  2. Affidavit or Affidavits
  3. Alias Declaration
  4. Alias Execution
  5. Alimony
  6. Appeal or Appeals
  7. Arbitration
  8. Assault
  9. Assault & Battery
  10. Assumpsit
  11. Attachment
  12. Bill - SupplementalL
  13. Bill for Account
  14. Bill for Administration
  15. Bill for Construction
  16. Bill for Direction or Bill for Directions
  17. Bill for Injunction
  18. Bill for Lien & Payment
  19. Bill for Ne Exeat
  20. Bill for Performance
  21. Bill for Receiver
  22. Bill for Relief
  23. Bill for Specific Performance
  24. Bill in Equity
  25. Bill of Exceptions
  26. Bill of Sale
  27. Bill to Cancel Deed
  28. Bill to Construct Will
  29. Bill to Marshall Assets
  30. Bill to Reform Deed
  31. CaSa
  32. Caveat
  33. Certiorari
  34. Civil
  35. Claim
  36. Complaint
  37. Complaint for Land
  38. Complaint on Bond, G
  39. Contempt
  40. Contract
  41. Covenant
  42. Cross Bill
  43. Damages
  44. Debt
  45. Debt on Bond
  46. Debtor's Relief
  47. Deceit
  48. Deed
  49. Deeds
  50. Distress Warrant
  51. Distribution
  52. Divorce
  53. Dower
  54. Ejectment
  55. Eviction
  56. Fifa or Fifas
  57. Garnishment
  58. Guardianship
  59. Habeas Corpus
  60. Homestead
  61. Illegality
  62. Interrogatories or Interrogatory
  63. Laborer's Lien
  64. Lease
  65. Legitimation
  66. Levy
  67. Libel
  68. Lien or Liens
  69. Lost Bond
  70. Lost Deed
  71. Lost Note or Lost Notes
  72. Lost Will
  73. Malicious Prosecution
  74. Mandamus
  75. Mechanic's Fifa
  76. Mortgage or Mortgages
  77. Mortgage Cancellation
  78. Motion or Motions
  79. Name Change
  80. New Trial
  81. Partition
  82. Petition
  83. Petition to Sell
  84. Plat
  85. Plea
  86. Possessory Warrant
  87. Power of Attorney
  88. Proceedings Against Tenant
  89. Receivership
  90. Reform Deed
  91. Rent Note
  92. Rules & Orders
  93. Scifa
  94. Slander
  95. Subpoena
  96. Suit on Endorsement
  97. Summons
  98. Support & Maintenance
  99. Tax Receipt
  100. Tenant Holding Over
  101. Testimony
  102. Title Bond
  103. Trespass
  104. Trover
  105. Trusteeship
  106. Waste
  107. Will
  108. Words
  109. Writ of Error

Each will have specific subjects mapped. For example, file units with scifa or scifas as part of the title will have the following additional headings:

Writs--Georgia--Troup County
Scire facias

Finding Aid Progress

Now that we've gotten several years worth of files back from the Troup County Archives and folks at the Digital Library have created DJVu derivative files, I was able to start checking the linking from the <daogrp> tags and to test the display of the finding aids. Given the size of the finding aid, I decide to create separate EAD instances for each record group.

  • Superior Court (which is huge and will probably need to be split further for ease of use)
  • Inferior Court
  • Ordinary Court
  • Justice of Peace Court
  • County Court
To mark up the finding aid, I had used regular expressions in NoteTab light. It went fairly quickly. I've used the RLG Best Practices document as my mark-up bible (http://www.oclc.org/programs/ourwork/past/ead/bpg.pdf). Having the derivative files helped me check to make sure my linking was working properly. The dao hrefs were pulled from info found in the finding aid

recordgroup#series#yearmonthfolder

So for Box 2, Folder 9/1830:6 from the Superior Court Record Group (i.e., record group 1) and Court Sessions series (i.e., series 5), my dao link (also to be my unique identifier for the DC records to be made later) would be

rg1se518300906

One of the issues that I did discover when linking was the issue of multiple folders representing a file unit. I'll need to add additional dao links within daogrp and tweak the stylesheet (a basic one DLG has employed for our other EAD projects, such as the Auburn Avenue Research Library's finding aids: http://dlg.galileo.usg.edu/aafa/) a little to make them display in a way that makes more sense. (Currently the stylesheet calls each daoloc in a group a page.) I'm also thinking of a way to automate adding the multiple daos and how best to split up the behemoth that is the Supreme Court Sessions finding aid. I did pull out some of the extras from the finding aid (the glossary and list of Troup County officials) into separate html files.

My plan is to use a link checker each time we post new files to catch link issues. Once the finding aid is in a mostly finalized form (i.e., all linking is perfect), I'll run it through the date normalizer perl script written by Jason Casden at the Ohio State University, Jerome Lawrence and Robert E. Lee Theatre Research Institute that is available through the tools page on the ead help pages (http://www.archivists.org/saagroups/ead/tools.html) and the RLG report card.