Theses and Dissertation Remediation Project

These procedures outline the process for inventorying, barcoding, and batch cataloging approximately 22,000 Theses, Dissertations, and Plan Bs in the USU University Archives. It includes instructions for doing a shelf inventory, assigning barcodes and call numbers, verifying data, assigning one of three specific batch workflows (Merge (no overlay), Merge (with Overlay), and Original cataloging) and provides step by step instructions for each of the batch workflows.

 


Tools Needed

  • Sierra (local catalog)

  • Airtable                                                                                             

  • OCLC Connexion

  • MarcEdit

  • Excel

  • Barcodes and Barcode Scanner

  • Label Printer

  • RFID Tags


Quick Navigation


Data Collection Process

  1. Export all USU theses records (in any format) from Sierra into an excel sheet [expand on this]

  2. Sort the records into years

  3. Clean and reorder Excel sheet by:

    1. Copy and pasting all records for a year (or decade, if there are sparse records for the year) into a new tab on the spreadsheet, including column labels

    2. Change column order to match Airtable “Next Batch Add” view

    3. Delete the 100 field column

    4. Combine the Birth and Death years into one column, add in the correct punctuation, delete individual columns for years

    5. Isolate the OCLC number

    6. Isolate the titles

      1. Copy the 245 to the very end of the spreadsheet

      2. Split off 245 c by using the text to column, with a delimiter of “/”. Review to make sure it split at the appropriate place. Correct any issues (|c not moved over or “/” was part of the title)

      3. Remove the ending period using the following formula: =IF(RIGHT(A2,1)=".",LEFT(A2,LEN(A2)-1),A2)

      4. Trim the leading/ending spaces with =TRIM([insert cell #])

      5. Scan and review for errors (leave strange text, it will be reviewed and corrected during Procedure 1)

      6. Copy the column and paste back into the original 245 space using paste->value

    7. Split the 502 into Theses Type, Department, Degree Type, and University Name

      1. Theses Type

        1. Use Text to Columns with a delimiter of “(“ to isolate the theses type

        2. Scan and review for errors/issues and consistency with the controlled vocabulary in Airtable

          1. Where found, correct if obvious, or select a default of “Thesis” (this will be reviewed in Procedure 1)

        3. Copy and paste into “Theses Type” column after the Year

      2. Degree Type

        1. Highlight the column with the remaining text (from the previous step)

        2. Use Text to Column with a delimiter of “)” to split out the degree

        3. Review the degrees and correct typos or punctuation errors. Match against the controlled vocabulary in Airtable

        4. Copy and paste into a new “Degree” column after the “Theses Type” column

      3. Department (Note, if the 690 field is consistently applied for the batch, it is a better option for isolating the Department name than the 502.)

        1. Highlight the column with the remaining text (from the previous step)

        2. Delete the double dashes, University name and the phrase “Dept. of “ from the text strings using find and replace the text string

        3. Use Text to Column with a delimiter of “,” to split out the year from the department

        4. Review the department names for spelling errors or extraneous text

        5. Copy and paste into a new “Department column after the the “Degree” column

      4. Add a column after “Department” and input the university name at the time the theses was issued. (See University Name changes for dates.)

      5. Delete all the unneeded work columns at the end of the spreadsheet

    8. Adjust the 300 field

      1. Replace "?eaves" or “ eaves” to "leaves"

      2. Copy field to end of spreadsheet

      3. Use text to columns with a delimiter of “;” to split off the |c. Verify that only the height information was split off, then delete this column (it is automatically applied in Airtable)

      4. Use text to columns with a delimiter of “:” to split off the |b wherever present

      5. Copy the controlled vocabulary for |b from Airtable and copy and paste to each occurrence OR in the next column, use the formula =IF(A2<>"",":$billustrations") and remove all the “FALSE” returns with a find and replace

      6. Split off any extra informaiton (such as maps, CDs) into the Extra Objects column

      7. Copy each into separate column that matches Airtable in order

      8. Ensure each subfield matches the spelling, punctuation, and spaces for the controlled vocabulary in Airtable (if relevant)

    9. Add columns for 500-2, 533, 590-1, 590-2, 690-2, 690-3, and 690-4 immediately after the Illustrations column, copy over relevant info. Note that the following information is NOT copied over into the new fields:

      1. “Library’s copy on microfilm”

      2. USU ETD

      3. Department statements

    10. Note that the columns from 500 on may be out of order if there is an extra note or field. Look at later columns to identify if that information should be moved to the left.

  4. Dedupe records (this can also take place before step 3, if desired).

    1. Sort and resort by different criteria (title, author, etc.) and scan for duplications

      1. Use the conditional formatting to highlight all duplicates in the combined author name

      2. Use the conditional formatting to highlight all the "z" in the material type

    2. If duplicates are found, Preference for a or t records, then b, then z

  5. Once finished, copy into Airtable

  6. Routinely or after each data load into Airtable, run a dedupe process and scan for potential duplicates

  7. Once finished, go to the Sign-Up tab and change the following for the year(s) that were just loaded:

    1. Pre-processing (Barcoding) Status: from “Hold off” to “Unclaimed”

    2. Last Completed Step: from “In Queue” to “Title List Loaded”

Procedure 1: Shelf Inventory

Collection Sign Up and Tracking

Please note that tracking will need to happen in two places because the Theses Project is part of the larger “Barcoding” project and needs to be reported in both the Barcoding Tracking, Issues, and Stats AND the Thesis Data Pull bases.  Please fill out both of these bases, as noted below.

Sign Up For A Collection

  1. Go to the Airtable base “Barcoding Tracking, Issues, and Stats”

    1. Click on the “Sign Up” tab

    2. Collections are organized by year.  Reserve a collection by

      1. Selecting a name in the “Name” column

      2. Changing the Current Status to “In Progress”

      3. Enter a date in the “Date Assigned” column

  2. Go to the Airtable base “Thesis Data Pull”

    1. Click on the Sign up tab.

    2. Collections are organized by year. Sign up for a collection by

      1. Selecting your name in the Pre-processing assigned to field

      2. Fill in the Pre-processing started

Completed Collections

  1. When you are completely finished with a collection year, fill out the following:

    1. Thesis Data Pull

      1. fill out the Pre-Processing End field on the Sign Up tab

      2. Change the Pre-Processing Status to “Completed”

    2. Barcoding Tracking, Issues, and Stats

      1. Fill out the “Date Finished” field in the Sign Up Tab

      2. Change the status to “Completed”

  2. Sign up for another collection year.

Daily Tracking

  1. Pull a cart’s worth of items from the year

  2. Go to the Airtable base “Barcoding Tracking, Issues, and Stats” - > “All Tracking

    1. Add a new line with:

      1. Name

      2. Area

      3. Sign Up ID

      4. First and Last call numbers on the cart

        1. As call numbers are not formally assigned yet – please use the format YYYY-LastName

      5. Date Started

  3. Process items following the steps in “Procedure 1: Shelf Inventory” section

  4. Return to the Airtable base “Barcoding Tracking, Issues, and Stats” - > “All Tracking

    1. Record the following information:

      1. Number of items reviewed

      2. Number of items barcoded

      3. Date Finished 

Statistics are used for tracking the progress of the project and also for reporting to the library administration on current and final numbers. Airtable automatically calculates current projects based on your statistics. It is VERY important to record them as listed above. In general, monthly statistics will be pulled from the Barcoding Tracking, Issues, and Stats Airtable base.

Procedure 1 Process

  1. Open the Thesis Data Pull -> Title List tab -> Master Sorted List view

    1. Note: Modifying existing views may cause problems for others who are using the base. If you would like to filter so that you only see the year you are working on, please go to the YYYY-Template view, then select the “duplicate view” from the three dots next to “Share view”.  Rename with the year appropriate (or whatever unique name is preferred). 

  2. Compare each item on the cart to the list of titles in Airtable

  3. For each item where the theses is found in the collection and also listed in Airtable

    1. Change the status to “Found”

    2. Add the Barcode to the item and record in the “Barcode” column

    3. Verify/update/add the following information using the item in hand. It is very important to match all fields to EXACTLY what is written on the title page and not the front cover of the thesis. There is no need to research or add information that is not available from the item in hand. 

      1. Author name and information (split into Last, First, Expanded, Suffix, and Birth/Death fields in Airtable.  PLEASE NOTE: Only verify that the information is correct as far as is evident from the title page and/or the author bio/vita, if available. If not readily evident, leave information as is). 

        1. For the author Mirla F. W. Morgan, the name would be recorded as:

          1. Author Last

            1. Ex. Morgan

          2. Author First

            1. This is generally first and middle name and/or initials as represented in the object

            2. Ex. Mirla F. W.

          3. Expanded Form of Name

            1. Use only if there was an initial used in the first or middle name and the full version is available.  In these cases record the total first and middle names inside of a parentheses. Add a space before the first parathesis.

            2. Ex. (Mirla Francis Whiting)

          4. Suffix

          5. Birth/Death Date

            1. This will not be commonly available- but might appear in the vita

            2. Record as “comma space birth year dash death year”: , 1929-2012

          6. NOTE: the automations in Airtable will combine this information to look like: Morgan, Mirla F.W. (Mirla Francis Whiting), $d 1929-2012 $e author

      2. Title

      3. Years

      4. Theses Type

      5. Department

        1. If the inventory lists a department but the theses lists no department, cut/paste the department into the “notes” field along with an explanation.

      6. Degree Type

      7. University Name

      8. Pagination

        1. Almost all pagination will be leaves.  Leaves indicate that the print is only on one-side of the page, whereas pages would mean that both sides of the page are printed on.

        2. Record the pagination as noted in the book.  Preliminary pages may be indicated as Roman numeral (v, vii, etc.) Main text of item is likely recorded in traditional Arabic numerals (125 or 67, etc.)  An example of an item with 7 pages of preliminary text that are numbered as Roman numerals and 82 numbered pages in Arabic numerals would look like this:

          1. vii, 82 leaves

        3. Unnumbered paged can be recorded in brackets: [8] leaves

      9. Illustrations

        1. This refers to any illustration at all.  Simply select it from the drop down menu ( it is the only option).  Leave blank if there are not illustrations.

      10. Bibliographic Reference Page

        1. Check that there is a bibliographic reference page (which is likely the case). If so, checkmark the “Bibliographic Reference” field.

  4. For an item that is in the collection, but not listed in Airtable

    1. Use the Search feature in Airtable to see if the item has been listed under a different year. NOTE: You will need to go to the Master Sorted List to conduct this search across all records.  Search for both the title and the author names to be sure it is not listed in another year.

    2. If found, verify the information according to step 3, correcting the year (or any other incorrect information)   

    3. If not found:

      1. Add a new line to the Airtable base and fill out the fields listed in step 3

        1. Exception: for Status, select “Added”

  5. For an item that is in Airtable, but is not found in the collection

    1. Verify that the item is not within the year range in process according to the process in step 4.a (i.e. if it was misfiled later in the year and hasn’t been pulled onto the cart.)

      1. If found, interfile in the appropriate spot and process according to step 3

      2. If not found in collection, select a status of “Not Found”

  6. At the end of every cart, ensure that the order of the material on the cart matches the order of the “Found” titles in Airtable (excluding any items not found). Using filtering in Airtable to limit the view to the year and “Found” status in order to see this list.

  7. Re-shelve and record statistics in “Barcoding Tracking, Issues, and Stats” Airtable base for each cart:

    1. Reviewed (refers to the number of physical volumes)

    2. Barcoded (refers to the number of barcodes placed)

Procedure 1: Known Oddities

This list will grow with time.  Please add to it as needed, to help collect the issues and actions into one space. For all oddities not listed here, please check the “Flag for Becky” column in Airtable and leave any pertinent information in the Notes fields.

Plan B Bound-withs

Explanation: Many of the Plan B’s will likely have more than one title bound into the item.  Key indicators include title pages or table of contents that list a “Report No. 1” (or 2, 3 etc.) and when the page is open to the second or subsequent reports, they show another title page. Another key indicator is drastic changes in page numbers. In later decades, Plan B reports are bound in blue. This is not true for earlier material, however.

Action:

  1. Select the status, select “Boundwith”

    1. This list will be compiled and cataloging entries created by hand

More than one copy or volume of item

Explanation: Sometimes theses were published with more than one volume and/or the library may have collected additional copies of theses/dissertation.  They will be housed next to one another on the shelf, but will not show up more than once in the Airtable list

Action:

  1. Add the barcode and copy/volume number to the appropriate field (1st Multiple) Barcode, (1st Multiple) Copy/Volume or (2nd Multiple) Barcode, (2nd Multiple) Copy/Volume, etc.) Note: Each additional item will require entry in two fields: Barcode and Copy/Volume. 2nd Note: Copy/Volume is set with a drop down for copies 2-5, volumes 2-5, and second copies of volumes 1-5.

  2. FOR MULTI-VOLUMES ONLY, please check the “This is the first volume in a multi-volume set”.  This will be used to generate the first volume number in the call number field

    1. Also change the pagination to “X volumes” where X = the number of volumes. There is no need to record the exact leaves or pagination in this case.


Cover has a different title than the title page

Explanation: Sometimes there are discrepancies between the title page and the cover or spine.  These are typically due to typos.  Regardless of which version is the correct grammar or spelling, the title page is always the main source of information. 

Action:

  1. Verify or record the title from the title page in the “Title” field in Airtable

  2. Record the title from the alternate source (almost always the cover) in the “Alternate Title” field

  3. Select the source for the alternate title in the “Alternate Title Came from” field

More than one entry in Airtable

Explanation: Titles may appear more than once in Airtable because the title list was pulled from all the records currently in Sierra (for physical or microform).  Prior to upload, the titles were de-duped.  But sometimes, the information did not match up one-for-one, so they will appear twice in the Airtable list.

Action:

  1. Preferred records have a material type of “a” or “t” along with a bib record number and an OCLC number. If not present, look for the next best record along these priorities

    1. Material of “a” or “t”. If none, then look for:

      1. Material type of “b” (preference for records with bib number and OCLC number). If none, then look for:

        1. Material type of “z”. If more than one, select the best of the records in Airtable (more accurate information or simply more information).

  2. Select “Found” on the preferred record and proceed with verifying the item in hand as usual

  3. Select “Duplicate” on the non-preferred record.  Please flag both items by checking the “Duplicate – Needs Review” field.

Thesis is from a different University

Explanation: Theses submitted to the archives from other universities may have been interfiled with USU theses.

Action:

  1. Verify that the thesis was not produced at USU (under the current or previous names of the institution) by examining the title page and if needed, additional pages such as the abstract or introduction.

  2. Mark the Status field in Airtable as “Other University”

    1. Note: Selecting this status will automatically create a new entry on the “Non-USU Theses” tab

  3. Put the thesis in the “Other University” shelf next to the barcoding table near the back stairwell in the stacks

  4. If the thesis has already been assigned a number, leave a comment on the record (tagging the Student Supervisor) so that the number can be reassigned

Thesis has more than one author

Explanation: On rare occasions, theses will list more than one author. Sometimes when this happens, there are two theses with the same title and content, but both authors are listed and are in opposite order on each thesis.

Action:

  1. Set the status to “Found”

  2. Check the “Flag for Becky” column

  3. Leave a note in the Notes column that explains why the item is flagged for Becky

  4. When the year is being numbered, be sure to include this item in the numbering, using the first author last name as the filing order.

 

 

Statistics for Procedure 1

Please refer to the Collection Sign Up and Tracking portion of these instructions for the full information on how to sign out carts and also track the work done. The final statistics that will be pulled for reporting purposes (not tracking) are as follows:

  1. In Airtable base “Barcoding Tracking, Issues, and Stats” - > “All Tracking

    1. Number of items reviewed

    2. Number of items barcoded

Procedure 1a: Quality Control for Shelf Inventory

  1. Sign up for a year in the “Student QC” column in the “Sign Up” tab in the Theses Data Pull Airtable base

    1. Where possible, arrange to work with another student tech on a year

  2. Check order of authors on the shelf

    1. Grouped by year, then alphabetized by Last Name, First Name.

  3. Check spelling and punctuation in “Title” field against the physical book

    1. Remove any strange characters and type out with your keyboard.

      1. For example, if your thesis has a title with a “β”, then the transfer process probably put funky symbols in the title field. Delete the unnecessary symbols, and write out “beta”.

    2. Make sure the only things that are capitalized are 1) The first letter of the title, 2) Any proper names, and 3) The first letter of any Latin names (ex: Clostridium botulinum).

  4. Compare the year in the thesis to the year on Airtable

  5. Check Theses Type

  6. Double check Department

    1. We’re going with whatever is in the book – if it says they received a M.S. in Education, use Education as the Department. If they specifically state the Department, use that. If you can’t find the Department, make sure the one listed in Airtable sounds reasonable for the content of the thesis, and then leave as-is.

  7. Check Degree Type

  8. Check University Name

  9. Under “Pagination”, make sure it says “leaves” at the end. Leaves are one-sided, pages are front and back. Most, if not all, of the theses should be leaves.

  10. Double check the multiples are recorded and that the section that says “(1st Multiple) Copies” has a selection underneath it (usually c.2), as well as a barcode in the “(1st Multiple) BARCODE” field.

  11. Check Status

    1. If there are two titles bound in the same book (most commonly found with Plan B Reports) then your Status should be marked “Boundwith” – this is going to be in the main thing you’re looking for. Some indicators that you might have Boundwith, besides the obvious point of having two titles, is pagination that starts over in the middle of the book.

  12. If you have to add a thesis to your year because it’s found its way back to the shelf after the barcoding process:

    1. Search Master Sorted List to avoid duplicate records.

      1. If found, change the status to “Found” and verify the information according to the instructions above

      2. If not found, add according to the instructions above

        1. Once added, comment on the year under the “Sign Up” tab and tag either @Seth Westenburg or @Bryn Larsen. This is an important step for all material prior to 1975 because the entire year will need to be re-numbered.

  13. When the year is finished, return to the Sign Up Tab and check the “Student QC Complete” field for the year

Statistics for Procedure 1a

Because statistics for the initial barcoding have already been recorded in the “Barcoding Tracking, Issues, and Stats” Airtable base.   Only record statistics in the main “Statistics” Airtable base.

  1. For each item reviewed, count the total in the “Batch Record Prep” column

  2. If during the QC process, missing information is applied, count each record once in the “Batch Record Prep” except the following

    1. Newly barcoded items = Record in the “Barcoding Tracking, Issues, and Stats” Airtable base in the “Barcoded” column

Procedure 1b: Assigning Call Numbers

  1. Refer to the “Shelf List/Call Number Assignment” view in the “Title List” in Airtable

  2. Limit view to one year at a time

  3. Print off the list (go to the ellipses …. And select the “print view”) or pull up on mobile device

  4. Bring list to Theses shelves

  5. Compare Airtable title order to shelf order and make sure they align

    1. They should be sorted in this order:

      1. Year

      2. Alphabetically by Author Last Name

      3. Alphabetically by Author First Name

      4. Alphabetically by Title

  6. Starting with the first item, assign a number to each item starting with 1.  Only assign an integer and no other characters, for example:  1, 2, 3, 4, etc.

    1. With the exception of copies and multi-volumes

  7. Record the number in Airtable (or on sheet and then transfer to Airtable).

  8. Verify all multi-copies are listed correctly

Procedure 1c: Print Labels

This process can be done immediately after assigning call numbers or it can be done at the end of Procedure 4’s batch processing routines. Printing labels can be done individually using Sierra (after the batch processing), individually using a PDF, or in batches. Below is listed the methods for printing individually via a PDF and in batches using mail merge.

Print Labels Using PDF

Print labels for each year.  You can create them by hand by inserting the appropriate year and number into this label template: https://usu.box.com/s/wn0dhynskjnjb1rvu3fkgib6auf91tph and then saving as a PDF before sending to the label printer.   

Batch Printing Labels

For this part of the process, you’ll be creating the data with which we will print the call # labels.

Create an excel file

  1. Open Excel file and title it “Mail Merge Template”.

  2. Create two columns: one for “Year” and a column for “No”.

  3. Input the year you are processing under “Year”.

  4. From the total shown in the number of rows in Airtable, create as many lines in Excel.

    1. Keep in mind copy 2s and multiple volumes, which will require creating labels by hand (see above).

  5. To quickly and simply create the entries you need:

    1.  Input the year into two lines, and then under number input just the “1” and “2”.

    2. Select the years and number cells.

    3. Click and hold on the bottom right square of the selection.

    4. While continuing to hold the mouse button, drag downward. If done correctly you will see a grey box by the mouse cursor showing a number. This number increases as you drag further and represents the current number created.

  6. “Save as” the Excel template with a unique name to the year (no need for strict naming rules, just saving it as something else).

 

Mail Merge (in Word)

Through Mail Merge, you’ll create a document that will automate the call # label printing process. The main purpose of using “Mail Merge” is for creating a bunch of address labels. Since you aren’t sending out wedding invites, the terminology might not make much sense but it’ll still do what we need.

  1. Open the “Thesis Mail Merge Template” Word document: https://usu.box.com/s/wn0dhynskjnjb1rvu3fkgib6auf91tph

  2. The “USU Spec Coll” section is a header created in Word. You do not (and should not) need to make any edits here.

  3. There should be an empty line between “Theses” and “no.”.

  4. There should be no space after “no.”.

  5. To begin the Mail Merge Process, click on “Mailings” in the top tool bar.

  6. Click on “Select Recipients”.

  7. Click on “Use Existing List”.

  8. Select the Excel file you created and saved and click “Open”.

  9. For the next dialogue box, you should be able to just hit “Ok”, but ensure that the checkbox is selected for “contains headers”. This will omit the top line in the Excel file that denotes the columns for “Year” and “No.”.

  10. At this point, your document will be prepped for pulling information from the Excel file.

  11. While still under the “Mailings” category in Word, find the “Insert Merge Field” button. Click on it, and notice that it gives you options for “Year” and “No”. You’ll use this to pull in information from the Excel file.

  12. Click on the empty line under “Theses”. The line should contain no spaces.

  13. Click on “Insert Merge Field.”

  14. Click on “Year”.

  15. You should now see <<Year>>.

  16. Click directly after the “no.”. Again, there should be no space after “no.”.

  17. Click on “Insert Merge Field”. This time, select “No”. You should see <<No>>.

  18. Through this, Mail Merge will create a page for each line in the Excel file, inserting the year and number listed. It doesn’t necessarily need to be sequential: you could (and likely later will need to) create a custom Excel file that lists any further labels you might need, such as copy 2s.

    1. For copy 2s or volume whatevers: the Word template should have space under the “no.” line where you could enter “c.2”. Make sure there are no spaces after “c.2” and that Word doesn’t create another blank page. (If it does just click at the top of the second page and hit backspace until it goes away.)

  19. For the last steps, click on “Finish & Merge”.

  20. Click “Print Documents”.

  21. In the next dialogue box, “All” should be selected. Click “ok”.

  22. For the printer, select “Adobe PDF” and select “Ok”.

  23. You will now save the document under whatever name works best for you.

  24. The file should open automatically. If not, find it and open it.

  25. Did you notice that “Merge to Adobe PDF” button? Yeah you’d think that would be a shortcut but it creates a PDF file for EACH label, which I discovered after testing. Now if you’ll excuse me I have to clean up a bunch of PDF files from my desktop.

  26. Print the document to the label printer.

    1. Before you print, you may want to make sure there are enough labels in the printer. It will resume printing upon replacing the roll if it runs out, so don’t worry if you forget to check.

  27. Do not save the Word Template! It keeps some residual info that might mess something up later. Haven’t played around with it enough to be sure, but just close the Word document without saving.

 

 

Procedure 1d: Attach RFID tags

 

Procedure 1e: Review Numbering and Attach Labels (use only when directed)

  1. Sign up for a year in the "Renumbering View" in SignUp tab in Airtable by adding your name to the 'Counting Assigned to" column

  2. Open the view "Verifying Call Numbers" in the Title List

  3. Limit view to the year signed up for

  4. Print off the list (go to the ellipses …. And select the “print view”) or pull up on mobile device

  5. Bring list to Theses shelves

  6. Compare Airtable title order to shelf order and make sure they align

    1. They should be sorted in this order:

      1. Year

      2. Alphabetically by Author Last Name

      3. Alphabetically by Author First Name

      4. Alphabetically by Title

  7. Verify all multi-copies are listed correctly

  8. If all looks well, follow Procedure 1c: Print Labels. Note that for now, it is best to use the "Print Labels Using PDF" instructions - which prints labels individually. Batch processing needs to be updated for multi-volume or multiple copies.

    1. If there are problems, notify they should notify the Student Technician Supervisor (Seth) or the Head of Cataloging (Liz)

  9. Once finished, go back to the "Renumbering View" in SignUp tab in Airtable and change the status to Numbered and Labeled


Procedure 2: Verifying Sierra and OCLC Records 

  1. Sign up for a year (or years) on the Sign Up tab under the view “Sierra Comparison Signup”. Find any year with a current status of “Labeled”.

    1. Add your name to the Sierra Comparison Sign Up column

    2. Change the Sierra Comparison Status to “In Progress”

  2. Using the “Comparison to Sierra” view in the “Title List” tab

    1. Look up each item BY THE TITLE in your assigned years in Sierra.  If present verify the Material Type:

    2. For Material Type of "a” (book) or “t” (manuscript)

      1. Add an item record to Sierra, using info from Airtable:

        1. Barcode

        2. Call Number (change 090 to 099)

        3. Location: ithes

        4. Item Type:  57 (theses or dissertations) or 56 (Plan B report)

          1. Note: if an item record already exists with the call number starting with “INQUIRE AT SPECIAL COLLECTIONS DESK”, delete this item record and create a new one with the information above. DO NOT DELETE any other item records (including items in the stacks, other areas of SCA, or microfilm.)

      2. Verify the Sierra bib number is the same in Airtable, replace with correct number if needed

      3. Look up item in OCLC. 

        1. First use the TITLE and/or AUTHOR, then use the OCLC number, if needed (add “#” or “no:” to search OCLC by number

        2. Check each print or manuscript record for the best match (title, author, and year).

        3. Verify it is a print record (FORM : [blank] or r) 

      4. In the “Batch Type” column in Airtable, select “Item Added – Overlay batch change”

    3. For Material Type of “b” (microform) or “z” (online, electronic)

      1. Search Sierra by title to confirm that there are no other appropriate records with “a” or “t” material types.

        1. If found, replace the Sierra bib number in Airtable with bib number for the record to be overlayed.

        2. Update the Sierra Material Type.

        3. Follow the steps for 2.b listed above (For Material Type of "a” (book) or “t” (manuscript))

      2. If no other other records are found, proceed to searching OCLC

      3. First use the TITLE and/or AUTHOR, then use the OCLC number, if needed (add “#” or “no:” to search OCLC by number

      4. Check each print or manuscript record for the best match (title, author, and year).

      5. Verify it is a print record (FORM : [blank] or r) 

        1. If the OCLC record is print AND the Sierra record is a “b” material type

          1. Select “Merge” as the “Batch type” in Airtable

          2. Verify that the OCLC record # matches the # in Airtable, replace with correct number if needed

        2. If the OCLC record is print AND the Sierra record is a “z” material type

          1. Select “Merge OCLC, No Sierra” as the “Batch type” in Airtable

          2. Verify that the OCLC record # matches the # in Airtable, replace with correct number if needed

        3. If there is no OCLC record for a print item, do the following to the Airtable record:

          1. Select “Original” as the Batch type

          2. Delete the OCLC # from its column

    4. For items “Added” to Airtable

      1. Search for item in Sierra by searching for title,

        1. If found, check the “Flag for Becky” column and leave a note in the “Notes” column

        2. If not found, proceed to next step

      2. Search for item in OCLC

        1. If a print record is found (has OCLC Fixed Fields Form: [blank] or r), add the following to Airtable

          1. Add the OCLC #

          2. Select “Merge” as the “Batch type” in Airtable

        2. If no print record is OCLC and no record in Sierra, record the following in Airtable:

          1. Select “Original” as the Batch type

  3. When finished, return to the “Signup” tab and update the following

    1. Last Completed Step - change to “Sierra Comparison”

    2. Sierra Comparison Status - change to Completed

Procedure 2: Known Oddities

This list will grow with time.  Please add to it as needed, to help collect the issues and actions into one space. For all oddities not listed here, please check the “Flag for Becky” column in Airtable and leave any pertinent information in the Notes fields.

Year is different between Airtable and OCLC or Sierra

Explanation: If there is a discrepancy in the theses/dissertation year listed in Airtable and the year listed in Sierra or OCLC, then the master records will need to be adjusted by a cataloger.

Action:

  • Check the “Flag for Becky” column in Airtable and leave a note in the “Notes” column explaining the issue (for example: “The year is listed differently in Sierra from Airtable”

OCLC returns no records when it is obvious one should exist

Explanation: Likely caused by beginner articles (a, the, an) or the use of Boolean terms in the title (AND, OR, NOT)

Action:

  • Strip out articles and Boolean terms or simplify to just the keywords in the title

 


Procedure 3 - Final Quality Control

This process will review the information recorded in each year and the batch type assigned to the item prior to Batch Record Creation process.

  1. Sign up for a year on the “Sign Up” tab in Airtable, including recording the name of the quality controller, the status (changed to “In Progress”), and the date the review started.

  2. In the “Title List” tab, use the “Batch Check” view and filter for the year being checked.

  3. Look for pastel colored lines to indicate where information is missing. 

    1. If found, use the commenting feature in each record to request a check and verification of the missing information

  4. Review the Batch Type and ensure the following parameters

    1. Merge - verify that there is both an OCLC # and a bib #, material should be set to be “b

      1. If the material type is “a” or “t” – double check Sierra for a print version bib that might have been missed in Procedure 2

      2. If the material type is empty or OCLC or Sierra is empty – double check Sierra to check for a print version bib

        1. If you do find a print version bib for either (not microform or electronic) – make a new Item record using Procedure 2, step 2-b

        2. Delete any “INQUIRE AT SPECIAL COLLECTIONS DESK” item records

        3. Change the Batch Type to “Item Added” and add the Sierra/OCLC numbers, plus “a” or “t”

    2. Item Added - verify that there is both an OCLC # and a bib #, material type should be an “a” or “t

    3. Original - verify that there is no OCLC #, bib #, or material type listed. (Note: the only exception is for a “z” material type, in which case there may also be an OCLC # or bib #, but these can be ignored & often are for electronic versions.)

    4. Merge OCLC, no Sierra - verify that there is an OCLC # and that there is no Sierra record (Note: These are often incorrectly labelled as “Merge” and can be spotted when “Merge” records have a “z” Material type record with both an OCLC# and a Sierra bib #. )

    5. Catalog Individually - not a batch catalog prospect

  5. Review author’s name fields, in particular:

    1. Fix errors caused by diacritics

    2. Add “$q” before all expanded forms and enclose the terms in parentheses:

      1. Expanded Form of Name: $q(Sidney Walker)

    3. Add “, “ [comma and a space] before all suffixes [See OCLC’s Bibformats page about personal names for more info]

      1. Make sure the suffix is not repeated in the expanded form:

        • Expanded: Sidney Walker, jr. <--take out duplicate suffix

        • Suffix: Jr.

        • rare exception:

          • Author first: Laurence J.

          • Expanded: Laurence Junior

    4. Add periods to initials, if missing

  6. Review the Title field, in particular:

    1. Fix errors caused by diacritics

    2. Remove ending periods

    3. Remove any subfield c information

    4. Ensure a subfield b is placed before any remainder of title (usually after a colon)

    5. Fix obvious spelling errors or typos. Be sure not to change scientific names or personal names.

  7. Assign the Theses Type (Standardized) based on the Theses Type field

    1. Thesis = Theses

    2. Plan B Report = Plan B Report

    3. Dissertation = Dissertation

    4. Any other type in the Theses Type field = Book Noncirculating

  8. Verify that the Department name does not have any spelling or punctuation errors

  9. Verify that the Degree type does not have a spacing, punctuation, or capitalization errors (Note: This may be done batch changes periodically.)

  10. Verify that the University Name is correct and correctly assigned for the time frame. (See University Name changes for dates.) If the latter appears incorrect, use the comment feature to request a check on the item. The information found on the thesis will be the final information recorded in the Airtable record.

  11. Review the pagination and make sure they are listed as “leaves”, check for egregious errors

  12. Verify that there is a barcode

    1. Check the multiple copies/volumes for the number of additional items that might belong to the title (Note: These are also flagged in purple.) Ensure that a barcode and copy/volume number is provided for each additional item.

  13. Verify that a number has been assigned to the item

  14. For the entire year, check the Numbering to ensure that there are sequential (no skipped numbers) and match the “Numbered” column in the Sign Up table.

    1. If there is a mismatch, check the “All Items Sorted by Date and Author” view in the Title. Missing numbers may be filtered out of your view because they have been flagged for review in a previous process

  15. Replace all double quotation marks (“) with a single quotation (') [note this is to make the find and replace in the batch processing simpler]

  16. For all “Item Added - Overlay batch changes”, check the item in Sierra to make sure the added item number matches the thesis call number in Airtable.

  17. For items that meet all of the above criteria, check the “Ready for Batch Cataloging” column

    1. Please skip any items listed as “Catalog Individually” in the Batch Type

  18. Record the status of the collection (year) on the Sign Up page and date when the review is completed.

    1. Complete = all items have been approved and no items are waiting to be reviewed by the cataloger

    2. Reviewed, some items remain = the year has been thoroughly reviewed and all applicable items approved, but there are remaining items in the year that need to be addressed. Record which items they are and needs to happen with them. 

    3. Hold off = there is a problem with all records in this year. These items will need to be revisited at a later date.


Procedure 4 - Batch Record Creation and Importing

This process will outline how to take the data from Airtable and merge with or create original records for OCLC. There are three processes used depending on the needs of the record:

  • Merge: Used for records where an existing OCLC record can be merged with the updated and verified Airtable table. Records are brought into Sierra as new records.

  • Overlay: Used for records where an existing OCLC record can be merged with the updated and verified Airtable table. Records are overlayed on top of existing Sierra records.

  • Original: Used to create new records where none exist in OCLC or in Sierra.

Please note that a single year may include all three workflows. Once all items in a year are accounted for, change the “Last Completed Step” column in the Thesis Remediation Airtable to the value ‘Complete’. Otherwise, change the value in this column to the last completed batch process.


Thesis Batch Merge Processes (with and without overlay)

General Merge Workflow Overview

MarcEdit Brief record creation for merging: export from AirTable → Open in Excel spreadsheet and edit → useMarcEdit Tab Delimited Convertor to make brief MARC records (.mrk) → check/edit MarcEdit Brief Records → save for merging.

OCLC records batch-search and edit: change (OCoLC) prefix to * → copy entire OCLC# column → save in notepad → load notepad txt file in OCLC batch processor → point to empty OCLC local save file → run OCLC Batch Process search → export all searched records → convert OCLC file to MarcEdit mnemonic (.mrk) → delete certain fields (such as the 502's) → save for merging.

Merge brief & OCLC batchsearched records together: batch edit → compile into MARC (.mrc) format → load into Sierra → make Create List of LOAD → global edit Item call numbers to 099 → DONE!

 

Changes needed for the two Batch Merge Processes

  1. Brief record will have:

    1. OCLC # - match point

    2. Title – match point

    3. 099

    4. 100 – cleaned up author

    5. 245 – cleaned up title

    6. 264 – correct year

    7. 300 – concatenated from the pagination column and the illustration column

    8. 500 – DEPARTMENT

    9. 500 – Transcribed university name (where different from the official name).

    10. 502 – statement: DEGREE $cGRANTING INSTITUTION NAME AT TIME OF AWARD $dYEAR

    11. 520 – Summary scraped from digital theses.

    12. 690s - Local Subject String

    13. 907 Sierra bib-number matchpoint (needed only for merge & OVERLAY)

    14. 949 info (barcode and call # and location code and item type code* )

  2. Batch Changes to OCLC record will be:

    1. Delete existing 100, 245, 264, 300, and 502 fields (merge afterwards)

    2. Add 33x fields if missing

    3. Change 504 to “Includes bibliographical references.”

    4. Add in 655_7 Academic theses |2 lcgft

    5. Add 590 “Thesis Remediation Project”

    6. Change 049 from UUSA to UUSP

    7. Add a 710 for Utah State University

    8. Add a 909 for CMS stats

 

I. DETAILED PROCEDURE FOR MERGE (with no overlay) workflow

  • MarcEdit Templates to download: under revision

 

  • Thesis-MERGEnoover-TEMPLATE3 is used during the conversion of the spreadsheet to MARC.

  • ThesisMERGEnoover-MergeSettings2 is used during brief & OCLC bib record merging.

 

  1. Spreadsheet Manipulations:

    1. Go to AirTable Thesis Data Pull

    2. Click on the Sign Up tab, Batch Loading view; and sign up for a theses year

      1. Batch Loader: you

      2. Batch Load Status: In Progress

      3. Fill in Batch Load Start Date and (if needed during the process) Notes.

    3. Go to Title List and the view: Merge Batch Work

      1. Check mark records in the column Batch Cataloging in Process

    4. Check all essential fields are filled:

      1. Flag any problems that need to returned for more work, checking etc. (incomplete, problem, etc.)

      2. Download CSV file of items by clicking on the down arrow at the top menu, in one of two ways:

        • First filter by your chosen year: Thesis Contains [your chosen year], before downloading;

        • Or, if titles are already filtered by your year, go ahead and download them

        •  

      3. Add the year of the download to the CSV name while downloading to the Desktop, like so: Title list-Melanie - Merge Batch Work - 1931

    5. Open Excel; Open other workbooks; on Computer; Desktop; choose All Files; navigate to the CSV file (under title: Title list-Melanie/Joe - Merge Batch Work; choose Delimited; Next; click in Comma box, Next, and Finish.

    6. If you haven’t done so already by filtering to your year in Airtable, delete all but your chosen year from the spreadsheet.

    7. Double quotation marks

      1. To separate and protect actual 500 quoted notes from CSV escape symbols; click on the corner of the spreadsheet to select the entire file;

        1. go to Find and Select; then Replace;

        2. Find what: " (double quote) – if there are no double quotes, go on to next step.

        3. Replace with: ^ (inverted caret); Save.

    8. Column work:

      1. Delete: Thesis column with text in the format of year-number-Author in the cells)

      2. Delete: Batch Cataloging in Process

      3. Delete: OCLC #

      4. Keep: OCLC w Prefix

      5. Keep all MARC tag columns (note: Excel will have unhelpfully changed 099 to 99; keep it anyway).

      6. Delete: Batch Type

      7. Delete: Date loaded

    9. These columns must remain (even if empty); because we will be using a MarcEdit template: again, do not delete any of these:

      1. OCLC w Prefix - in this format, (OCoLC)####

      2. 99

      3. 100

      4. 245

      5. 264

      6. 300

      7. 500

      8. 500-2

      9. 500-3

      10. 502

      11. 520

      12. 590-1

      13. 590-2

      14. 690-1

      15. 690-1a

      16. 690-2

      17. 690-3

      18. 690-4

      19. 949

      20. first Multiple949

      21. second Multiple949

      22. third Multiple949

    10. Save As the Excel Merge file as Text (Tab Delimited); under a new name (such as: Theses-MergeNoover-year)

 

  • Convert to a file of MARC records using MarcEdit with our AirTable data:

    1. In MarcEdit; Tools; Delimited Text Translator;

      • Source File: Theses-MergeNoover-year

      • Output File: [new name such as Theses-MergeNooverBrief-year]

    2. Click Import File (formerly Next)

      1. Then: click on Load Template (may need to scroll down to the bottom of the window) navigate to Thesis-MERGEnooverMARC-TEMPLATE3 (downloadable template is available at the beginning of this procedure , if needed)

      2. Fields will load automatically in the Arguments window.

      3. Check the Template has loaded the correct mappings in the Arguments window (scrolling with the mouse or arrow keys)

        1. Tags and first subfield:

          1. 035$a           

          2. 099$a           

          3. 100$a                         

          4. 245$a           

          5. 264$c            

          6. 300$a           

          7. 500$a

          8. 500$a   

          9. 500$a       

          10. 502$b 

          11. 520$a       

          12. 590$a

          13. 590$a

          14. 690$a

          15. 690$a

          16. 690$a

          17. 690$a

          18. 690$a

          19. 949$a

          20. 949$a 

          21. 949$a

          22. 949$a

          23. Click Finish.

            •  

    3. Edit the Brief Records in MarcEdit:

      1. Open MarcEditor,

      2. Click on File; Open; then navigate to the new file.

      3. You can doublecheck the mappings by looking over the first mock record in the file to see that the tags and mapping match:

    4. Use Reports; Field Count; and check that 100, 245, 300, etc. match number of records (ignore 500’s, 590’s, etc., where more than one field will exist per record)

      1. (However: that first mock record will also be counted; so subtract 1 when looking for the correct number of records)

      2.  

    5. Edit the Brief record (.mrk) file.

      1. Use Edit; Find; Replace to get rid of CSV double-quotes (used to protect commas in the 100’s, etc.).

      2. Find: " (one double quote)

      3. Replace: (with nothing)

        •  

      4. Replace all to clean up the entire file.

      5. Then replace any 500 quoted notes, if any (which would be protected earlier by replacing their quotation marks with carets):

      6. Find: ^ (inverted caret)

      7. Replace: " (double quote)

      8. Replace all.

      9. Like so:

        • =500 \$aInstitution name listed as ^Utah State Agricultural College^ on the title page.

        • =500 \$aInstitution name listed as "Utah State Agricultural College" on the title page.

      10. Then File; Save (it’s a good idea to do a save after each major edit).

    6. Check the subfield d’s in the 100 fields which often have problems:

      1. Edit; Find: =100

        •  

           

      2. To fix them, do an Edit; Find; Replace

        • Find what: $d$e ← which won’t occur anywhere else in the records.

        • Replace with: ,$e comma,$e

        •  

    7. If there are any authors with birthdates in the 100s that are missing a d-subfield, do an Edit Field Data to add them:

      1. Tools; Edit Field Data:

      2. Field: 100

      3. Find: 1 (type a single space, followed by a 1)

      4. Replace: $d1 (which will take out the space, add the subfield d & reinsert the 1)

      5. then if there are incorrect commas at the end of a year [dash], do another Edit Field Data:

      6. Field: 100

      7. Find: -,$e (a dash, comma, $e)

      8. Replace: -$e (removes the comma)

      9. Example of (most probably) exhaustive name with subfields: Smith, Sam C.$q(Sam Christian),$cJr.,$d1985-2045,$eauthor.

    8. Do a Edit; Find; Find All on =245 for each article (as below) to check for titles with articles that are missing a skip indicator. MarcEdit usually catches all the A, An, and The articles when converting to MARC, but it misses one or two on occasion.

      • =245 10$aA ← add a single [space] after each to avoid title words starting with 'A'

      • =245 10$aAn ← add a single [space] after each…

      • =245 10$aThe ← add a single [space] after each…

      • Replace any found with the correct skip number (remembering to add the following space):

        • Find: =245 10$aA ← add a single [space]

        • Replace: =245 12$aA ← add a single [space]

        • Replace All: (and make sure the number of modifications matches the number of titles found with the issue)

          •  

    9. Then add our local constant data:

      1. Tools; Add/Delete Field; copy/paste the values below in blue; then Add Field:

      2. Add 590, field text: \\$aThesis Remediation Project.

      3. Add 655 field text: \7$aAcademic theses.$2lcgft

      4. Add 710 field text: 2\$aUtah State University.

      5. (Note that each field will also be added to the first mock record).

    10. ***Check for any double spaces that may have been mistakenly entered in previous steps, e.g. “50 leaves _ _ :$illustrations"***

      1. If these truly don’t belong in the record, use Find & Replace to reduce to one space.

    11. Finally, use Edit; Find on important fields to do quick check for any more egregious errors (but don’t get caught in the weeds).

    12. Remember to Save the file of cleaned up brief records for the Merging step.

  • Batch-search Connexion with OCLC numbers:

    1. In Excel and Notepad, create a text file of OCLC numbers for the batch search in OCLC Connexion.

      1. Open Excel and go back to the spreadsheet: Thesis-MERGEnoover-year

      2. Copy/insert OCLC w Prefix into a new column; Replace (OCoLC) with * (asterix).

      3. Select all the OCLC numbers; right-click; choose format cells; & change Category to text.

      4. Copy and paste the cells in the OCLCbatchsearch column to Notepad; save as text; with an easy to find title, in this format: OCLCnos-year

      5. Note the number of *OCLC rows (records).

    2. In Connexion, go to File; Local File Manager; choose an empty Local Save File path (or make a new one).

      1. Go to Cataloging; Search; Local Save File; then hit OK, leaving the search box blank, to make sure there are no records lingering in there;

      2. Delete any lingering records.

    3. Go to Batch; Enter Bibliographic Search Keys; Import.

      1. Navigate to your text file of *OCLC numbers, make sure the Default Index is: None, and click Open.

      2. I always click Don’t Delete, when asked (to allow for error checking and redoing).

      3. The text file should load into the Bibliographic Batch Search Window; click Save; then close window.

    4. In File; Local File Manager; Set up Authorizations; add Authorization & Password; Save & Login.            

    5. Go to Batch; Process Batch; make sure the empty Local Save File is still selected (if not, select it again).

      1. Only Online Searches should be selected; then click OK.

      2. The OCLC numbers should scroll down the view window, until all have been searched.

      3. Finally, a Batch Search Report will appear; go down to the last save number in the report; it should equal the number of *OCLC row numbers that you wrote down earlier.

    6. The Local Save File should have received all the records.

      1. Go to Cataloging; Search; Local Save File; then hit OK, leaving the search box blank.

      2. Select every record in the batch save file (select first record; hit Ctrl and Shift together, at the same time clicking on the last record).

      3. Action; Export; name the export file, something like this: THESES-OCLCbatchsearch-year.

    7. Open MarcEdit; click on MARC Tools icon; leave Select Operation as MarcBreaker;

      1. Open (under All Files): THESES-OCLCbatchsearch-year

      2. Save As: Theses-OCLCbatchsearch-year

      3. Leave Default Character Encoding as MARC8

      4. Click Execute; then click on Edit Records:

        •  

    8. Edit; Find/Replace: =049 \\$aUUSA to =049 \\$aUUSP; Replace All.

    9. Delete these fields which we will be replacing with our updated versions:

      1. Edit Records and Tools; Add/Delete Fields, fill in the tag in Field; and leave Field Data blank; then click Delete Field:

        1. Field: 100

        2. Field: 245

        3. Field: 260

        4. Field: 264

        5. Field: 300

        6. Field: 502

        7. Field: 520

      2. For the following Fields, we will be deleting certain duplicate versions, using Add/Delete Fields:

        1. Field: 655 Field Data: \7$aAcademic theses.$2lcgft

        2. Field: 655 Field Data: \4$aAcademic theses.

        3. check for other versions of 655 Academic theses headings to delete, but don’t remove FAST headings:

          • =655 \7$aAcademic theses.$2fast$0(OCoLC)fst01726453 keep fast 655 fields

          • Delete:

            • =655 \2$aAcademic Dissertation$0(DNLM)D019478

            • =655 \7$aTh{grave}eses et {acute}ecrits acad{acute}emiques.$2rvmgf$0(CaQQLa)RVMGF-000001173

            • =655 \7$adissertations.$2aat$0(CStmoGRI)aatgf300028029

            • =655 \7$adoctoral dissertations.$2aat$0(CStmoGRI)aatgf300312076

            • =655 \7$amasters theses.$2aat$0(CStmoGRI)aatgf300077723

            • =655 \7$atheses.$2aat$0(CStmoGRI)aatgf300028028

    10. Find; Find All =504 to see that all them have the form: Includes bibliographical references;

      • Find: =504 .* [Must select “Use Regular Expressions” option]

      • Replace: =504 \\$aIncludes bibliographical references.

    11. Search for any questionable 008 fields that don’t have an s date type.

      1. Find all: =008 [0-9]{6}[^s] [Must select “Use Regular Expressions” option]

      2. These will have a “q” and two dates in the 008 instead of just one.

        1. Jump to Record:

          •  

        2. The field in question will be highlighted, so do an Edit; Replace

        3. Copy/paste and very carefully change only the effected portions of the 008: qyearyear with syear\\\\

          1. An example:

            • Find: q19291957 (the original cataloger was unsure of the year, and put in a time span)

            • Replace: s1923\\\\ (but we know the year was 1923)

        4. Run Reports; MARCValidator to make sure no blanks “\” or digits haven’t been lost in the 008 fields.

          • Note that MARCValidator may mark the lack of 245 fields as an error, but those will be replaced in the next (Merging) step.

        5. Save edited OCLC file for Merging step (and Close).

  • Merge Brief file and OCLC batch-searched file records together:

    1. Go to Tools, MARC Processing Tools, Merge Records, then navigate to the two files, in this general form:

      1. Source file: Theses-OCLCbatchseach-year.mrk    → (OCLC batch-searched/edited records)

      2. Merge File: Theses-MERGENooverBrief-year.mrk(brief records with our data)

      3. Save file: Theses-MERGED-noover-year.mrk          → (future file of merged records)

      4. Record identifier:  035$a → (Important Matchpoint)

    2. Click Next:

      1. Use the saved Merge Settings template: click on Settings; Load Settings;

      2. Then navigate to: Thesis-MERGEnoover-MergeSettings2

      3. If the Settings file isn’t working; highlight the fields below, and click on the arrow to select them for merging into the OCLC file of records.

    3. Do quick check of the loaded Settings:

      • 049 UUSP

      • 099                 local call number

      • 100 Author (our corrected version)

      • 245 Title (ours)

      • 264 RDA production date (ours)

      • 300 Physical description (ours)

      • 500                 Department & transcribed note

      • 502                 RDA standardized thesis statement

      • 520 Scraped summary field

      • 590 Project name

      • 655                 Academic theses genre term

      • 690                 local subject string

      • 710                 university

      • 949                 automatic item record creation

    4. Then click Next to do the merge.

  • Batch copy catalog the merged records in MarcEdit (click on MarcEditor icon; Open and navigate to newly merged file):

    1. Do a Reports; Field Count; to check the merge worked; check that field 001, 035, 245 matches correct number of bibs;

    2. Edit/Find various important fields: 100, 245, 502, etc.,

    3. Check for duplicated fields, such as 300 fields; and delete one version.

    4. Edit/Find all =949 fields and ensure they match the number of barcoded items expected

    5. Scan through a few bibs, to look for egregious errors, etc. -- but don’t get lost in the weeds.

    6. Check 520’s for text that is cut off. Where found, add an ellipsis (…) to the end of the field.

    7. Remove extraneous spaces: Edit field data (common fields inc. 100, 245, 520). [must use regex]

      1. Find: _{2,} “_” indicates a single space

      2. Replace: _

    8. Check 65Xs for foreign-language entries.

      1. Easiest if you alphabetize the Find results

    9. Sign your work!

      1. Add/Delete Fields; Field: 909 Field Data: \\$aMelanie Shaw$b2022-05-04$eMODIFIED$dCMS STATS

    10. Save.

    11. Validate the file by going to Reports; MARCValidator or using Control M

      1. Common errors include:

        1. Dollar signs in the 520 → Fix by replacing dollar signs with a {dollar}

        2. When working with economics or business theses, the validator may return an error that the 520 field has an invalid subfield. It is likely that the summary/abstract contains a monetary value, and MarcEdit is interpreting the ‘$’ used to denote USD as a subfield delimiter. If this occurs, please refer to the relevant batch-edit step at the end of this workflow.**

    12. Save file again.

    13. File; Compile into MARC format with a new name: ThesesMERGED-noverDONE-year.mrc

  • Load file into Sierra and track:

    1. Use the first load table: LOAD/Overlay bibs & create items (.briefbiboverlay), and remember to click on the square with Use Review Files” before starting the test & load.

      1. do a Test load first; check to make sure the number of bibliographic records and the number of item records is correct (Note: if theses have more than one copy or volume, these numbers will differ)

      2. if the test goes through with all the records; Load.

      3. Go to Create Lists, then choose an empty line with enough room for your load; click Copy icon; and find the file, which will look somewhat like this:

        • LOAD: inserted files for ThesesMERGED-nooverDONE-year (6)

        • It will ask Do you want to remove the file being copied from? Yes, you do.

      4. Double-click on the new list and check a few bibs to make sure the records loaded properly.

      5. Leave your Load list in Create Lists for later Global Edit to the item records in Sierra.

      6. Leave the Load list in Create Lists for later Global Edit in Sierra.

  • Follow the process for Global Edits to change the 090 to 099 in the item record : https://usulibrary.atlassian.net/l/cp/z0cUb0Ur (note: this is a restricted workflow)

  • Titles with multiple copies per record

    • Use list of item records created for 090 to 099 change.

      Derive lists of items that have the same copy number -- e.g., a list of all “copy two” records – by searching for whose 099 field has “, c.X” where X is the copy number

      Global updates:

      ·         Edit variable-length field: 099[i] “, c.X” => “[blank]”

      ·         Edit fixed-length field: Copy number: 1 => X

      These two edits can be made simultaneously.

  • **Fixing $ error:

    Use the newly created bibliographic-type review list to perform a batch update.

    • Create a second bibliographic-type list by searching the first list with the added condition:

    Bibliographic  MARC Tag 520  matches  "|[0-9]"

    • In the Global Update function, search for the second review list

    • Perform a “Variable-length field edit” on the 520 fields, changing all “|” to “$”

    • Finally, because this will convert the initial subfield A delimiter to a “$” as well, perform a second ““Variable-length field edit” to change all “$a” to “|a”

    • Check a couple of records as normal to verify no errors were created in the process

  • Return to the MERGE view in the Thesis Remediation Airtable & record the Date Loaded in Sierra.

    1. In the Sign-Up Tab, change the following:

      1. “Merge Batch Loaded Status” to Completed

      2. “Merge Batch Load Started” and “Merge Batch Load Ended” dates are filled in with the correct information

      3. Add any notes that will be needed for future in the Batch Loaded Notes field (for example, if there were issues encountered or records that need to be reviewed by someone)

      4. “Last Completed Step” column to “Merge Batch Loaded” UNLESS the “Remaining Items to be loaded into Sierra” value is 0 (zero) - in which case, change to “Complete”

Global Update for Thesis Batch Load link: https://usulibrary.atlassian.net/wiki/spaces/ULC/pages/1497038917

 Done!

II. Detailed Procedures for OVERLAY (with Merge) workflow

Undergoing revision, please wait until finished

Overlay_DTT_Coversion_Templant.mrd for conversion to MARC:

Used for overlaying a Sierra record for thesis/dissertation which was actually cataloged for a print format thesis.

  1. Spreadsheet manipulations:

    1. Go to AirTable Thesis Data Pull

    2. Click on the Sign Up tab, Batch Loading view; and sign up for a theses year

      1. Batch Loader: you

      2. Batch Load Status: In Progress

      3. Fill in Batch Load Start Date and (if needed during the process) Notes.

  2. Go to Title List and the view: Overlay Batch Work

  3. Check all essential fields are filled:

    1. Flag any problems (incomplete, problem, etc.)

    2. Check for missing barcodes in 949s (do not select)

    3. Mark all non-problem items within a year (if possible) and download CSV file of items.

    4. Checkmark the column Batch Cataloging in Process for the chosen year.

    5. Open Excel; choose All Files; navigate to CSV file; choose Delimited; tab; & Save as Excel File with new name, (something explanatory like: Theses-mergeOVERLAY-year).

    6. Delete all but the chosen year, if not done already.

    7. Column work:

      1. Delete: Thesis column with text in the format of year-number-Author in the cells)

      2. Delete: Batch Cataloging in Process

      3. Delete: OCLC #

      4. Keep: OCLC w Prefix

      5. Keep all MARC tag columns (note: Excel will have unhelpfully changed 099 to 99, keep it anyway).

      6. Delete: Batch Type

      7. Delete: Date loaded

    8. All these columns should remain (including blank columns):

      1. OCLC w Prefix - in this format, (OCoLC)#### [will serve as 035]

      2. 099

      3. 100

      4. 245

      5. 264

      6. 300

      7. 500

      8. 500-2

      9. 500-3

      10. 502

      11. 520

      12. 590-1

      13. 590-2

      14. 690-1

      15. 690-1a

      16. 690-2

      17. 690-3

      18. 690-4

      19. 907 (for overlaying)

  4. Save the Excel OVERLAY file as Text (Tab Delimited); new name: Theses-mergeOVERLAY-year

  5. Double quotation marks protection:

    1. To separate and protect actual quoted notes from CSV escape symbols:

      1. Click on the corner of the spreadsheet to select the entire file;

      2. Go to Find and Select; then Replace;

      3. Find what: " (double quote)

      4. Replace with: ^ (inverted caret).

      5. Save file.

  6. Convert to a file of MARC records using MarcEdit with our AirTable data:

    1. In MarcEdit; Tools; Delimited Text Translator;

      1. Source File: Theses-mergeOVERLAY-year

      2. Output File: [new name such as Theses-mergeOVERLAY-Brief-year

    2. Click Import File (formerly Next);

      1. Then: click on Load Template; navigate to Overlay_DTT_Coversion_Templant.mrd (downloadable template is available at the beginning of this procedure )

      2. Fields will load automatically in the Arguments window.

      3. Check the fields against the list in step c, to make sure if they loaded correctly (only the MARC field/1stsubfield will show.

    3. Or, you can manually enter in each column (MarcEdit calls them fields) and map them to MARC:

      1. Select or type: MARC tag; 1st subfield; and Indicators; (no terminal punctuation needs to be added).

        1. Fields for checking or error resolution in blue:

          1. 035$a            Indicators: \\                      no terminal punctuation

          2. 099$a            Indicators: \\                      no terminal punctuation

          3. 100$a            Indicators: 1\                     no terminal punctuation              

          4. 245$a            Indicators: 10                     no terminal punctuation

          5. 264$c             Indicators: \0                     no terminal punctuation              

          6. 300$a            Indicators: \\                      no terminal punctuation

          7. 500$a            Indicators: \\                      no terminal punctuation (all of these)

          8. 502$b            Indicators: \\                      no terminal punctuation

          9. 520$a Indicators: \\                      no terminal punctuation 

          10. 590$a Indicators: \\ no terminal punct. (all of these)

          11. 690$a            Indicators: \\                      no terminal punctuation (all of these)

          12. 907$a           Indicators: \\                      no terminal punctuation (overlay b-number)

        2. Click Add Argument to enter each mapping. The MARC fields will show up in the Arguments preview;

          • right-click to Delete or Edit.

        3. Click Finish.

  7. Edit the Brief Records in MarcEdit:

    1. In MarcEdit; open MarcEditor; Open, then navigate to the new MARC mnemonic (.mrk) file

    2. You can doublecheck the mappings by looking over the first mock record in the file to see that the tags and mapping match.

      1.  

  8. Go to Reports; Field Count to make sure the number of records is correct

    1. (records number should be the numbers of the 100, 035, & 245 field count; HOWEVER, the first mock record will be counted: so subtract 1 for the correct number)

    2. Use Edit; Find; Replace to get rid of CSV double-quotes.

      1. Find: " (one double quote)

      2. Replace: (with nothing)

      3. Replace all to clean up the entire file.

      4. Then fix any 500 quoted notes (which were protected back in the spreadsheet by replacing their quotation marks with carets):

      5. Find: ^ (inverted caret)

      6. Replace: " (double quote)

      7. Like so:

        • =500 \$aInstitution name listed as ^Utah State Agricultural College^ on the title page.

        • =500 \$aInstitution name listed as "Utah State Agricultural College" on the title page.

      8. Then File; Save (it’s a good idea to do a save after each major edit).

    3. Use Edit; Find on each field to check for errors

      1. Check the subfield d’s in the 100 fields which often have problems:

        1. Edit; Find: =100

        2. Edit; Find; Replace

          • Find what: $d$e ← which won’t occur anywhere else in the records.

          • Replace with: ,$e ← [comma]$e

      2. Edit; Find; Find All on =245 for each article (as below) to check for titles with articles that are missing a skip indicator. This is usually caught by MarcEdit, but sometimes…

        • =245 10$aA ← add a single [space] after each to avoid title words starting with 'A'

        • =245 10$aAn ← add a space

        • =245 10$aThe ← add a space

      3. fix any articles found using Edit; Find; Replace:

        1. Find: =245 10$aThe add a space

        2. Replace: =245 14$aThe ← add a space

    4. Add our local constant Data:

      1. Field: 590, field text: \\$aThesis Remediation Project.

      2. Field: 655 field text: \7$aAcademic theses$2lcgft

      3. Field: 710 field text: 2\$aUtah State University.

    5. Scroll down to look for egregious errors in our locally generated (variable and local constant data) fields, especially for our descriptive/searchable tags, but don’t get lost in the weeds.

    6. Then Save the file of cleaned up brief records.

  9. Batch-search Connexion with OCLC numbers:

    1. In Excel and Notepad, create a text file of OCLC numbers for the batch search in OCLC Connexion.

      1. If not already done, copy/paste OCLC numbers to a new column; replace (OCoLC) with * then name the column: OCLC batchsearch.

      2. Select all the OCLC numbers; right-click; choose format cells; & change Category to text.

      3. Copy and paste the cells in the OCLCbatchseach column to Notepad; save as text; with an easy to find title, in this format: OCLCnosOV-year

      4. Note the number of *OCLC rows (records).

  10. In Connexion, go to File; Local File Manager; choose an empty Local Save File path (or make a new one).

    1. Do a search/return key to make sure there are no records lingering in there;

    2. Delete any lingering records.

  11. Go to Batch; Enter Bibliographic Search Keys; Import.

    1. Navigate to your text file of *OCLC numbers, make sure the Default Index is: None, and click Open.

    2. Always click Don’t Delete, when asked.

    3. The text file should load into the Bibliographic Batch Search Window; click Save; then close window.

    4. In Local File Manager; Set up Authorizations; add Authorization & Password; Save & Login.            

    5. Go to Batch; Process Batch;

      1. make sure the empty Local Save File is still selected (if not, select it now).

      2. Only Online Searches should be clicked.

      3. Click OK;

      4. The OCLC numbers should scroll down the view window, until all have been searched.

      5. Finally, a Batch Search Report will appear; go down to the last save number in the report; it should equal the number of *OCLC row numbers that you wrote down earlier.

    6. The Local Save File should have received all the records.

      1. Go to Search; Local Save File; then hit Search, leaving the search box blank.

      2. Select every record in the batch save file.

      3. Export; name the export file, something like this: THESES-OCLCbatchsearchOver-year.

  12. Open MarcEdit; click on MARC Tools icon; leave Select Operation as MarcBreaker;

    1. Open (choose All Files): Theses-OCLCbatchsearchOver-year

    2. Save As: Theses-OCLCbatchsearchOVER-year

    3. Leave Default Character Encoding as MARC8

    4. Click Execute; then click on Edit Records:

      1.  

    5. Edit; Find/Replace: =049 \\$aUUSA to =049 \\$aUUSP

    6. Delete these fields which we will be replacing with our updated versions:

      1. Tools; Add/Delete Fields, fill in the tag in Field; and leave Field Data blank; then click Delete Field:

        1. Field: 100

        2. Field: 245

        3. Field: 260

        4. Field: 264

        5. Field: 300

        6. Field: 502

        7. Field: 520

      2. For the following Fields, we will be deleting certain duplicate versions, using Add/Delete Fields:

        1. Field: 655 Field Data: \7$aAcademic theses.$2lcgft

        2. Field: 655 Field Data: \4$aAcademic theses.

        3. Field: 655 Field Data: \4$aAcademic dissertations.

        4. check for other versions of 655 Academic theses headings to delete, but don’t remove FAST headings:

          • =655 \7$aAcademic theses.$2fast$0(OCoLC)fst01726453

    7. Check that 504 fields have: Includes bibliographical references (and change any incorrect), but don’t add them where they are missing.

      1. Find: =504 .* [Must select “Use Regular Expressions” option]

      2. Replace: =504 \\$aIncludes bibliographical references.

    8. Save for Merge step.

  13. Merge Brief file and OCLC batch-searched file records together.

    1. Go to Tools, MARC Processing Tools, Merge Records, then navigate to the two files, as in this example:

      1. Source file: Theses-OCLCbatchseach-year.mrk    → (OCLC batch-searched/edited records)

      2. Merge File: Theses-mergeOVERLAY-Brief-year.mrk → (brief records with our data)

      3. Save file: Theses-mergedOVERLAY-year.mrk          → (future file of merged records)

      4. Record identifier:  035$a → (Important Matchpoint)

    2. Click Next to begin selecting the fields to merge.

    3. Select/type each field to be merged into the OCLC batch-searched file; click the Green arrow to select each:

      • 049 UUSP

      • 099  local call number

      • 100 Author (our corrected version)

      • 245 Title (ours)

      • 264 RDA production date (ours)

      • 300 Physical description (ours)

      • 500  Department & transcribed note

      • 502  RDA standardized thesis statement

      • 520 Scrapped summary

      • 590 Project name

      • 655  Academic theses genre term

      • 690   local subject strings

      • 710  university

      • 907  Sierra bib-number for overlay

    4. Then click Next to do the merge.

  14. Batch copy catalog the merged records in MarcEdit:

    1. Reports; Field Count; to check the merge worked (001, 035, & 245 fields should match number of records);

    2. Edit; Find; Find All various fields, search for egregious errors, etc.

    3. Remove extraneous spaces: Edit field data (common fields inc. 100, 245, 520). [must use regex]

      1. Find: _{2,} “_” indicates a single space

      2. Replace: _

    4. Validate

      1. When working with economics or business theses, the validator may return an error that the 520 field has an invalid subfield. It is likely that the summary/abstract contains a monetary value, and MarcEdit is interpreting the ‘$’ used to denote USD as a subfield delimiter. If this occurs, please refer to the relevant batch-edit step at the end of this workflow.

    5. Compile cleaned up merged files into MARC format:     ThesesmergedOVERLAYdone-year.mrc

  15. Load file into Sierra and track:

    1. Use the first load table: LOAD/Overlay bibs & create items (.briefbiboverlay), and remember to click on “use review files”;

      1. Do a Test load first; then if the test goes well; Load.

      2. Go to Create Lists; Copy; find file, looking somewhat like this:

        • LOAD: inserted files for ThesesmergedOVERLAYdone-year (#) ← number of bib records loaded in parentheses

      3. Click on a few records and make sure the records loaded properly.

      4. Leave the Load list in Create Lists for later Global Edit in Sierra.

  16. Follow the process for Global Edits to change the 090 to 099 in the item record : https://usulibrary.atlassian.net/l/cp/z0cUb0Ur (note: this is a restricted workflow)

  17. Titles with multiple copies per record

    • Use list of item records created for 090 to 099 change.

      Derive lists of items that have the same copy number -- e.g., a list of all “copy two” records – by searching for whose 099 field has “, c.X” where X is the copy number

      Global updates:

      ·         Edit variable-length field: 099[i] “, c.X” => “[blank]”

      ·         Edit fixed-length field: Copy number: 1 => X

      These two edits can be made simultaneously.

  18. **Fixing $ error:

    1. Use the newly created bibliographic-type review list to perform a batch update.

      b. Create a second bibliographic-type list by searching the first list with the added condition:

      Bibliographic  MARC Tag 520  matches  "|[0-9]"

      c. In the Global Update function, search for the second review list

      d. Perform a “Variable-length field edit” on the 520 fields, changing all “|” to “$”

      e. Finally, because this will convert the initial subfield A delimiter to a “$” as well, perform a second ““Variable-length field edit” to change all “$a” to “|a”

      f. Check a couple of records as normal to verify no errors were created in the process

  19. Return to the OVERLAY view in the Thesis Remediation Airtable & record the Date Loaded in Sierra.

    1. In the Sign-Up Tab, change the following:

      1. “Overlay Batch Loaded Status” to Completed

      2. “Overlay Batch Load Started” and “Overlay Batch Load Ended” dates are filled in with the correct information

      3. Add any notes that will be needed for future in the Batch Loaded Notes field (for example, if there were issues encountered or records that need to be reviewed by someone)

      4. “Last Completed Step” column to “Overlay Batch Loaded” UNLESS the “Remaining Items to be loaded into Sierra” value is 0 (zero) - in which case, change to “Complete”

 

Done!

 


III.Thesis Batch ORIGINAL Processes

Template for converting from Tab-Delimited Text (.txt) document to Mnemonic Mark (.mrk) document:

Note: This template (and all others) will only work if the columns in the .txt document are ordered exactly as included in the template.

General Workflow:

MarcEdit original record creation: export AirTable records → Open in Excel spreadsheet and edit → use MarcEdit Tab Delimited Convertor to create MARC records (.mrk) → check/edit MarcEdit Brief Records → edit fixed fields to match variable fields as needed → compile into MARC (.mrc) format for importing into OCLC.

Import/Produce newly created MARC records in OCLC: without logging into OCLC → Run OCLC validation and fix errors as necessary → login to OCLC when done and update holdings → records will receive OCLC numbers → export records to desktop → import batch file into Sierra → Global edit Item Record call numbers to 099 → DONE.

MarcEdit additions to include for Original Record Process

  • 049 – UUSP

  • 33x fields

  • 504 “Includes bibliographical references.”

  • 590 “Thesis Remediation Project”

  • 655_7 Academic theses. |2 lcgft

 

Detailed Procedures for Original workflow: 

UNDERGOING REVISION: please wait before using

  1. Spreadsheet manipulations:

    1. Go to AirTable Thesis Data Pull

    2. Click on the Sign Up tab, Batch Loading view; and sign up for a thesis year.

      1. Batch Loader: you

      2. Batch Load Status: In Progress

      3. Fill in Batch Load Start Date and (if needed during the process) Notes.

    3. Go to Title List and the view: Original Batch Work

    4. Choose just one year at a time to work on (if you don’t, individualized editing will be necessary later on).

    5. Check all essential fields are filled:

      1. Flag any problems (incomplete, problem, etc.)

      2. Check for missing barcodes in 949s (do not select)

      3. Download CSV file of items within a year, or download entire view and delete rows of other years later.

    6. Open Excel; Open other workbooks; on Computer; Desktop; choose All Files; navigate to the CSV file (under title: Title list-Melanie - Original Batch Work, & Save as an Excel File with new name, (something explanatory like: Theses-Original-year).

    7. Delete all but your chosen year in the spreadsheet, if not done before.

    8. Remove any columns no longer needed for the MARC record:

      1. Thesis (year-#-author),

      2. Batch Cataloging in Process,

      3. OCLC # (which will be empty),

      4. OCLC w/ Prefix,

      5. Batch Type,

      6. Date Loaded in Sierra,

      7. Barcode

      8. Status

    9. And keep these columns (all MARC fields):

      • 099

      • 100

      • 245

      • 264

      • 300

      • 500’s

      • 502

      • 504

      • 520

      • 690’s

      • all 949’s (including empties)

    10. Save the Excel Original file as Text (Tab Delimited);

      1. new name: Theses-Original-year.

    11. Double quotation marks protection:

      1. To separate and protect actual quoted notes from CSV escape symbols:

        1. Click on the corner of the spreadsheet to select the entire file;

        2. Go to Find and Select; then Replace;

        3. Find what: " (double quote)

        4. Replace with: ^ (inverted caret).

        5. Save file.

  2. Convert to a file of MARC records using MarcEdit with our AirTable data:

    1. Open MarcEdit: Tools; Delimited Text Translator;

      • Source File: Theses-Original-year

      • Output File: [new name such as Theses-Original-MARC-year.mrk

      •  

  3. Fixed Fields (Leader)

    1. In the Options portion of the Delimited Text Translator, click on the blue Edit LDR/008:

      1. LDR Edits

        1. nam” to “ntm”

        2. “Ia 45e0” to “7i 45e0”

      2. For the Leader, make sure the dropdown says: Book.

      3. While still in the LDR, change nam to ntm and Ia to 7i

        • 00000nam 2200000Ia 45e0

        • 00000ntm 2200000Ki 45e0

          • t = manuscript monograph

          • K = (obsolete OCLC minimal record (should use either 3 or 7, the MARC21 equivalent)

          • a = AACR2 to i=ISBD (used in combination with 040 rda code)

      4. For the 008, copy & paste the blue 008 values into the 008 edit box (replacing year with the 4 digit year):

        • s9999\\\\xx\\\\\\000\0\und\d

        • syear\\\\xx\a\\\\\bm\\\000\0\eng\d

          Be extremely careful in the 008: it is really easy to accidently delete or add a digit, and then the entire file will have faulty 008 fields.

          Click OK.

    2. Click Next to begin the task of selecting each column/field and mapping it to MARC.

  4. Select or type: MARC tag; 1st subfield; and Indicators; no end punctuation needed in this project. 

    1. 099$a            Indicators: \\                      no terminal punctuation

    2. 100$a            Indicators: 1\                    no terminal punctuation              

    3. 245$a            Indicators: 10                     no terminal punctuation

    4. 264$c             Indicators: \0                     no terminal punctuation              

    5. 300$a            Indicators: \\                      no terminal punctuation

    6. 500$a            Indicators: \\                      no terminal punctuation

    7. 500$a            Indicators: \\                      no terminal punctuation

    8. 502$b            Indicators: \\                      no terminal punctuation

    9. 504$a Indicators: \\  no terminal punctuation

    10. 690$a            Indicators: \\                      no terminal punctuation

    11. 690$a            Indicators: \\                      no terminal punctuation

    12. 949$a            Indicators: \\                      no terminal punctuation no terminal punctuation

      1. Note: There should only be one 949 for each copy of the item, e.g., “c.2,” “c.3,” etc.

  5. Click Add Argument to enter each mapping.

    1. The MARC fields will show up in the Arguments preview; right-click to Delete or Edit.

      • Note that Calculate common nonfiling data is checked. This will automatically adjust the 245 indicators for A, An, or The in the first word of a title of a record.

    2. Finish.

  6. In MarcEdit; open MarcEditor; then find the new MARC mnemonic (.mrk) file

    1. Note: To solve issues related to the records' Leaders, compile the .mrk file into an .mrc file. Then, break the new .mrc file and perform the required edits on the resulting .mrk file.

  7. Use Reports; Field Count to make sure the number of records is correct (the first mock record will be counted; subtract 1 for the correct number)

  8. Use Reports; MARCvalidator to check records have been entered correctly during the conversion process:

    1. If errors are found; fix them, if it isn’t major, otherwise, go back and run the conversion again.

  9. Use Edit; Find; Find: " (double quotes) for odd double quotes around entire text in fields where it is not called for.

    • Then use Edit; Find; Replace:

    • Find: "

    • Replace: (with nothing)

    • Replace all

    • Change ^ (caret) back into " (double quotes):

      1. Find: ^ (caret)

      2. Replace " (double quotes)

      3. Replace all.

  10. Use Edit; Find; Find All on each field to check for errors; then add local constant data:

    1. Add 040, field text: \\$aUUS$beng$erda$cUUS

    2. Add 049, field text: \\$aUUSP

    3. Add 590 field text: \\$aThesis Remediation Project.

    4. Add 655 field text: \7$aAcademic theses$2lcgft

    5. Add 710 field text: 2\$aUtah State University.

    6. Scroll down the MARC records and check for GAPS after adding all those fields (which will affect all records):

    7. Once the the adds have gone through without major hiccups, run Reports; MARCValidator, to make sure all is well.

    8. Next, add the 33x RDA format fields, but instead of adding them individually…

      • Add 336

      • Add 337

      • Add 338

    9. Use the RDA Helper in Tools with everything except Add 336, 337, 338, deselected:

      1. It’s a great time saver.

    10. Change ^ (caret) back into “ (double quotes):

      1. Find: ^ (caret)

      2. Replace " (double quotes)

      3. Replace all.

    11. Edit fixed field 008 in first record, if 008 fields come through incorrectly:

      1. This will only edit one record at a time and for that record all the other codes will need to be typed in.

      2. Place cursor at the beginning of the first record, the click on Edit; Insert/Edit 008; and choose Book:

    12. Edit all the values for these areas (leave the backslashes or zeroes in place for all others):

      1. Type: t

      2. Ills: a ← unless there is no illustrations

      3. Cont: bm

      4. Srce: d

      5. Lang: eng

      6. Ctry: xx\

      7. Dates: year of production

      8. Click OK

      9. In order to fix all the other record's 008 fields; use Find/Replace with the old 008 fields copy/pasted from an unedited record in the Find box and the newly edited 008 on the first record in the Replace box.

        • Find: =008[old 008 data from unedited record]

        • Replace =008[newly edited data from edited record]

      10. Look at the recently edited 008 fields by Edit; Find, and scan down the find screen to look for oddities, such as missing indicators or jagged lines showing too short 008s with missing codes.

      11. Also, recheck edited fields by validating: Edit; MARCValidator and fix 008 errors.

    13. Finally, delete the first mock record: Edit; Delete Records; By Range; and type in 1; then click OK:

      • And the mock record will disappear.

  11. Scan through record for egregious errors in any other Fields, especially for descriptive/searchable tags; but try not get lost in the weeds.

  12. Remove extraneous spaces: Edit field data (common fields inc. 100, 245, 520). [must use regex]

    1. Find: _{2,} “_” indicates a single space

    2. Replace: _

  13. Validate

    1. When working with economics or business theses, the validator may return an error that the 520 field has an invalid subfield. It is likely that the summary/abstract contains a monetary value, and MarcEdit is interpreting the ‘$’ used to denote USD as a subfield delimiter. If this occurs, please refer to the relevant batch-edit step at the end of this workflow.

  14. Save the file of cleaned up records.

  15. Compile into MARC: ThesesOriginalDONE-year.mrc

  16. Import new original theses records into OCLC:

    1. In OCLC Connexion:

      1. First time: Set up the File; Local File Manager to create a new Local Save File path. If already created, skip this step.\

        1. Configure the path; Close.

    2. Each time: If the File Path has already been set up:

      1. Any time you are ready to import new records into OCLC, make sure there are no earlier records left in the save file: especially important for this workflow

      2. F3 and return without inputting a search; then delete any bibs found.

    3. In Connexion, but without logging in:

      1. File; Import Records; navigate to record file in MARC21/MARC8.

      2. Click on Import to Local Save File Bibliographic: should show the new File Path for the save file.

      3. Click OK.

      4. Connexion will immediately import the file into the Save File, but since you are not logged in** the records will not yet receive OCLC record numbers.  A Report with the Import results will pop up: it can be printed out (which includes as a PDF), if a record of the results will be needed.

        1. **Note: If this is your first time cataloging offline, you will likely need to set up your offline options:
          1) USU Libraries' OCLC symbol: UUS
          2) Holding symbol: UUSP
          3) MARC institution code: ULA

    4. Bring up the Connexion Save file with F3 and Return key without entering any search terms:

      1. All of the records should be there in the Save file list and can be marked and/or previewed.

      2. Delete the first “dummy” record that was created by Marcedit (the one containing empty fields); unless it has already been removed.

    5. Select all records (Ctrl+A for Windows; Cmd+A for Macs)

      1. Then Edit; Validate entire file:

      2. Fix any errors, or if too many errors, delete entire saved records in OCLC save file, and go back to MarcEdit to fix the problems there; then, redo import with clean records.

      3. After all records validate without errors in Connexion, take a quick check for punctuation errors of one or two bibs, and if all checks out, then:

    6. Log into Connexion and mark entire set of records,

      1. Then: Action; Holdings; Produce/Update Holdings

      2. Each record should be assigned a new OCLC #

    7. Mark and Export entire file to desktop.

    8. Name the file of new records something memorable and easy to pick out of a list like so: ThesesOriginalDONE-year

  17. Load file into Sierra:

  18. Use the first load table; and remember to click on “use review files”; and do a test load first; then if the test goes well; Load.

    1. Go to Create Lists; Copy; find file, looking somewhat like this:

      1. LOAD: inserted files for ThesesOriginal-MARC-year (6)

    2. Click on a few records and make sure the records loaded properly.

    3. Leave the Load list in Create Lists for later Global Edit in Sierra.

    4. Follow the process for Global Edits to change the 090 to 099 in the item record : https://usulibrary.atlassian.net/l/cp/z0cUb0Ur (note: this is a restricted workflow)

    5. Titles with multiple copies per record

      • Use list of item records created for 090 to 099 change.

        Derive lists of items that have the same copy number -- e.g., a list of all “copy two” records – by searching for whose 099 field has “, c.X” where X is the copy number

        Global updates:

        ·         Edit variable-length field: 099[i] “, c.X” => “[blank]”

        ·         Edit fixed-length field: Copy number: 1 => X

        These two edits can be made simultaneously.

    6. Fixing $ error:

      1. Use the newly created bibliographic-type review list to perform a batch update.

      2. Create a second bibliographic-type list by searching the first list with the added condition:

      Bibliographic  MARC Tag 520  matches  "|[0-9]"
      iii. In the Global Update function, search for the second review list

      iv. Perform a “Variable-length field edit” on the 520 fields, changing all “|” to “$”

      v. Finally, because this will convert the initial subfield A delimiter to a “$” as well, perform a second ““Variable-length field edit” to change all “$a” to “|a”

      iv. Check a couple of records as normal to verify no errors were created in the process

    7. Return to the ORIGINAL view in the Thesis Remediation Airtable & record the Date Loaded in Sierra.

      1. In the Sign-Up Tab, change the following:

        1. “Original Batch Loaded Status” to Completed

        2. “Original Batch Load Started” and “Original Batch Load Ended” dates are filled in with the correct information

        3. Add any notes that will be needed for future in the Batch Loaded Notes field (for example, if there were issues encountered or records that need to be reviewed by someone)

        4. “Last Completed Step” column to “Original Batch Loaded” UNLESS the “Remaining Items to be loaded into Sierra” value is 0 (zero) - in which case, change to “Complete”

 Done!

Statistics for Procedure 4

 

Final procedure 5 for project clean up:

Draft procedures.

Cleanup workflow A. MICRO FORM THESES bib with only a MICRO FORM THESES item record attached

microformtheses3 review file (for now)

  1. Go to Sierra Catalog, and search by both title and author (in case of typos in original bib) to find all bib formats:

    • Ignore any Online bibs.

    • If there is a physical bib, go down to workflow B

    • If there is no physical bib, check Airtable base to see if that particular title has been set aside for separate work: https://airtable.com/appUfDCSIMhNvaqGp/tblSEK4W7ICh0KRP4/viwzjFFLJVBceDqIp?blocks=hide

      • Check both Masterlist-DO NOT CHANGE view, and

      • Cataloged Separately view, and

      • Sign up tab, and for that year, open year record, open hidden, click show all linked records, and scroll down them looking for your author, click on record if there, to open it

    • If it has been loaded, check the microform record for a physical copy for the ithes location with this kind of call number: THESES year no. xxx

      • Check Airtable base view for All to see if there ever was a physical archival copy in SCA.

      • If the title in question is stuck somewhere in the process, and was never loaded, go on to next title in review file.

  2. But, If no record of a physical copy is found anywhere in Airtable:

    • Go on to next step

    • Edit the Sierra Bibliographic Record:

    • Once certain that there is no physical copy of the microform thesis, start changing the record into a microfiche format record:

    • 001 add prefix of “micro and postfix of “nmic

      • Like so: 001 micro12345nmic

      • If the 001 already has micro654321nomatch or micro12345nmic do not change it--just check that Bib Material Type is b, and that the 33x fields are correct for microform microfiche, then remove it from the Create Lists review file as “done”

    • Bib Sierra Material Type: b (microform)

    • Bib 008, expand: ItemForm: b (microfiche) <--fixed field specific format code

    • 300 should generally not be edited, unless egregiously wrong.

      • For instance: if it’s the wrong microformat: 1 microfilm reel

      • 300 can have just the pagination, or xxx “microfiche” in addition to pages; both of these are just different rules over time.

    • Copy paste over top of the complete text of 33x fields, if different from these:

      • 336 text|btxt|2rdacontent

      • 337 microform|bh|2rdamedia

      • 338 microfiche|bhe|2rdacarrier

    • Delete 500 or 590 Library’s copy on microfilm. <--optional step; it’s always microfiche for theses.

    • Item record will already have these values (but check anyway):

      • Item Type should already be 67 (microform)

      • Location: amcbt (microform cabinets)

      • Call Number 099 MICRO FORM THESES

  3. Congratulations!

    1. Statistics: count as 1 Sierra Edit, if you made any changes to the bib.

  4. Remove record from named Create Lists review file: find your title; select it; then click Remove:

Cleanup Workflow B: Sierra has 1 new Theses Remediation Project bib (with more than 1 item):

THESES-done Create list review file

  1. In Sierra the new project bibs will have these markers:

    • 049 UUSP

    • 099 THESES COLL [year] [thesis number].

    • AND a 590 for Thesis Remediation Project.

    • There should also be at least 1 item record for the project:

      • 099 THESES COLL year no.

      • Location: ithes

      • Item Type: 57

    • Multiple item records are on this one, including one for a physical volume in the Barn, which should stay on this bib

    • If all of these are MISSING, go to another workflow

Next step: copy the THESES COLL bib which will be turned into a micro form theses bib:

  1. In Catalog; click Edit bib; then Edit; and Copy Record; and if Sierra asks,

    1. choose default (b)

    2. Warning: the new bib will be identical; I tend to check I’m now in the copied bib by looking in Summary: there should be NO item records as yet

  2. Start editing the copied bib:

    1. The new MICRO FORM THESES bib will need these edits:

      1. Bib Sierra Material Type: b (microform)

      2. Bib 001 add prefix of “micro” and postfix of “nmic

      3. 001 micro111111nmic

      4. Bib 008; right-click; Expand: FormItem: b (microfiche)

      5. Bib 049 UUSO

      6. Bib 099 MICRO|aFORM|aTHESES

      7. 300 should generally not be edited, unless egregiously wrong.

        • if for instance, it’s the wrong microformat: 1 microfilm reel

        • It can be the pagination of original volume, or xx microfiche or a variation of those…

    2. 33x fields: copy/paste the complete text over the bibs fields, if different (or add if missing):

      1. 336 text|btxt|2rdacontent <--usually this is correct, but sometimes there are no 33x’s

      2. 337 microform|bh|2rdamedia

      3. 338 microfiche|bhe|2rdacarrier

  3. Save the MICRO FORM THESES bib and rerun the title search to bring up all versions

  1. Select the MICRO FORM THESES item record: Edit; Transfer Attached

    1. Do the title search again; then choose MICRO FORM THESES Bib

    2. And in Summary click on Use Bib

  • This what we want to happen, so click OK

  1. Go back to the micro form theses bib

    1. And check Item Record:

  • Item Type: 67 (microform)

  • Location: amcbt

  • 099 MICRO FORM THESES

  • Sign record the new Bib record with 909:

    • y (Misc.) 909 Melanie Shaw|b2024-12-02|cnot sure what to put here|dCMS STATS

  1. Hurray it’s done!

  2. Statistics: 1 original catalog?

  3. Go back to Create List review file: find title; and Remove that fixed bib.

 

Cleanup workflow C. If there’s 1 bib and it is NOT a Theses Remediation Project or edited MICRO FORM THESES only bib:

  1. Double check for the Theses Remediation Project bibs looking for these markers:

    1. 049 UUSP

    2. 099 THESES COLL [year] [thesis number].

    3. AND a 590 for Thesis Remediation Project.

    4. There should also be at least 1 item record for the project:

      • 099 THESES COLL year no.

      • Location: ithes

      • Item Type: 57

      • If no markers are in the bib record, but there is an item record for it, then skip step 2 (checking Airtable) and go To copy catalog a missing bib for the Thesis archival copy.

  2. Go to https://airtable.com/appUfDCSIMhNvaqGp? Theses Remediation Project Airtable, then Master List--DO NOT CHANGE (also, be careful not to change anything)

    1. Search for the theses in question by title or author:

    2. If found, Expand record and scroll down to Hide 93 fields (click to un-hide), then search for Status: Not Found

      1. If physical copy Status is Not Found, go back to Cleanup Workflow A and use that to make a Microform only bib.

To copy catalog a missing bib for the Theses Collection copy
  1. Add to the Sierra bib for the microform record:

    • 001 micro11111nmic

    • Add prefix and postfix to 001 micro11111nmic version.

    • change 33x fields to microform/microfiche versions:

    • 337 microform|bh|2rdamedia

    • 338 microfiche|bhe|2rdacarrier

    • Go onto next step in OCLC.

  2. Find the OCLC bib and add these fields to the OCLC record, but if there is already an item record for the ithes item, attached to the microform bib, do not add a 949 field:

  • 500 Department: WHAT EVER IT IS.

  • 590 Thesis Remediation Project.

  • 655 7 Academic theses.|2lcgft

  • 690 Thesis. U--- DEPARTMENT. <--whatever USU is called in that year.

    • Link to USU name chart is here:

  • 690 Utah State University Theses.

  • Do not add a 907 - we are not overlaying anything.

  • 909 Sign your record

  1. Export into Sierra as a new record.

  2. Bring up both bibs by title in Sierra, and transfer archival Thesis item record (if it exists) to the new print bib.

    1. Choose the THESES COLL bib first to bring up the item record to move

    2. Edit-->Transfer; search title again.

    3. Choose the new micro form theses bib and click Use Bib

  3. Ta da!

    1. Statistics: 1 Sierra Edit and 1 copy catalog

  4. Go to the Create Lists review file: find the title; and Remove it.

 

 


 

DO NOT USE: For bibs with no overlay, no print records: This procedure has already been done (kept for historical purposes):

  1. double check there are no print records in list (maybe export to spreadsheet and check items location/callno & bib 001):

Create List review file: microformthese-micro-AndNot0, exported to Excel:

  1. Global Edit (use extreme caution)

    1. Make sure Review is selected in dropdown

    2. Find the review file, in this case: microformthese-micro-AndNot0

c. Click Search to bring it in

  1. Click Command input

    1. then Add

      • Match Whole Field

      • uncheck (use displayed field)

      • r 337 b b unmediated|bn|2rdamedia

      • r 337 b b microform|bh|2rdamedia

    2.  

c. Click OK

r 337 unmediated|bn|2rdamedia → r 337 microform|bh|2rdamedia

Note: I filled in all the Replace values also, because otherwise, Sierra couldn’t find them…

  1. Double check for any typos:

  1. Click on Preview:

  2. Double check that it is indeed doing what it should, in the preview window:

  1. Then Process

  1. If you did the 337 separately, go back to Select record and click Search to bring the bibs back in again to do the matching 338 fields…

    1. Click Add again and fill in the values for the 338.

    2. r 338 unspecified|bzu|2rdacarrier → r 338 microfiche|bhe|2rdacarrier

    3. Preview

    4. Process

    5.  

       

  2. If necessary, between Global edits, go back to review list to look at the 337 or 338 terms and subfields needing different terms to change

    1. …and keep Global Editing until there’s just a handful to fix.

  3. To clean out the few remaining (18 bibs for these):

    1. go back to the same review file and add a line to the search for removing the fixed bibs.

    2. In this review file, there were just some odd 338 terms left, so I restricted it further to those missing the correct term “microfiche” there:

  4. Here are the 18 bibs:

    1.  

  5. At which point, I just manually edited them in the Create Lists review file (because they were an odd group of errors).

  6. Count each bib edited (but not each edit within them) as Sierra Edit in statistics (9194 in this batch).

  7. Just a bit complicated, but better than having someone manually editing them.