Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

These procedures outline the process for two batch merge workflows that combine item level information collected in an Airtable base with existing OCLC records - as part of a large scale remediation project for Congressional Hearings.

...

General Brief to OCLC Merge Workflow

MarcEdit Brief record creation for merging: export fromAirTable → Open in Excel spreadsheet and edit → use MarcEdit Tab Delimited Convertor to make brief MARC records (.mrk) → check/edit MarcEdit Brief Records → save for merging.

OCLC records batch-search and edit: change (OCoLC) prefix to * → copy entire OCLC# column → save in notepad → load notepad txt file in OCLC batch processor → point to empty OCLC local save file → run OCLC Batch Process search → export all searched records → convert OCLC file to MarcEdit mnemonic (.mrk) → delete certain fields → save for merging → Recopy the OCLC numbers with * prefix → Go back to OCLC and run the batch process to set holdings in OCLC.

Merge brief & OCLC batchsearched records together: batch edit → compile into MARC (.mrc) format → load into Sierra → make Create List of LOAD → global edit Item call numbers to 086 → DONE!

...

  1. Preprocess:

    1. Jen’s crew has checked for physical items, added barcodes, checked SuDoc numbers, and so on, in the spreadsheet & uploaded to Airtable.

    2. Cheryl will delete MQ barcodes from Sierra before we bring in our Merged Batchload records.

    3. Liz has created 949 fields for each item record in Airtable.

    4. Various preprocessing will be run for items that lack a 001 or 035, or that have wrong OCLC numbers brough over from Sierra to the two Merge views.

    5. 949s for titles with multiple volumeswill to be grouped to be added to a separate workflow.

  2. Excel spreadsheet processing will remove excess columns, change 86 to 099, check for the usual errors, add collection name field, and OCLC batchsearch column for later.

  3. Brief records will be created/edited in MarcEdit:

    1. OCLC # (OCLC Masterpiece change to 035 - match point for OCLC merge)

    2. Title – for error resolution, only - will not be merged into OCLC bibs.

    3. 086 - taken from the first (home) columnand change to 099 - our SUDOC number

    4. 590 field for the Project Name

    5. Record #(Biblio) change to 907 - Sierra bib-number match point (needed only for merge & OVERLAY)- match point for Sierra import

    6. 949 - formed in Airtable - (will contain barcode-from Airtable and SUDOC - from Airtable; plus identical location codes, item type codes, etc.) – EXCEPTION: multivolume titles will be adding 949 during OCLC steps.

  4. Batch Changes to OCLC record will be:

    1. Change 049 from UUSA to UUSH.

    2. General and very modest edits for the usual obvious errors.

  5. Merge Brief bibs with our local data into OCLC bibs:

    1. 035 OCLC Masterpiece

    2. 099 SUDOC

    3. 590 field for the Project Name

    4. 949

    5. 907 (for New Barcodes, Update Sierra, Update Holdings).

Undergoing revision, please wait until finished

1-Detailed Procedures for New Barcodes, Update Sierra, Set holdings: (MERGE Overlay)workflow

For government documents which already have a record loaded in Sierra (usually from Marcive GPO), but needed a new barcode, we will use the barcode to make 949 item record field, put the Sierra bib# in a 907 for overlaying, and the SUDOC in a 099.

AirTable Extraction:

  1. Go to AirTable Hearings 9-10:

    1. Sign up for a certain section of SuDoc Y4 numbers

      1. Batch Loader: you

      2. Batch Load Status: In Progress

      3. Fill in Batch Load Start Date and (if needed during the process) Notes.

    2. Go to view: New barcodes, Update Sierra, Set holdings:

      1. Flag any problems that need to returned for more work, checking etc. (incomplete, problem, etc.)

      2. Download CSV file of items by clicking on the down arrow at the top menu, in one of two ways:

        • First filter by SUDOC stem (easily done by adding a filter of Sign up Name is exactly: [your name]--then delete just that filter when done downloading)

        • OR

        • Download entire set and delete other titles after opening file in Excel (probably too bulky to do).

      3. Distinguish the download by changing the CSV name of Imported table-New barcodes, Update Sierra, Set holdings after downloading, something like this: Newbarcodes-Y4V2-HRG101-103.

Excel Column work:

  1. OpenExcel app first

    1. Open; from PC; Desktop; All Files;

    2. Then, navigate to the CSV file (such as Newbarcodes-Y4V2-HRG101-103); then choose Delimited; Next; click in Comma box, Next; choose Text, and Finish.

    3. If you haven’t done so already by filtering to your SUDOC stem in Airtable, delete all but your chosen Y4 group rows from the spreadsheet. or copy/paste your chosen titles to another spreadsheet.

  2. Delete all these columns in the spreadsheet:

    • The 3 sign up columns: Sign up Name, Batch Cataloging Status, Start Date

    • Final Source

    • Stem

    • Condition

    • Batch Cataloging Status

    • Record #(Item) - we don’t use this

    • 86 (since we’ll be using the first SuDoc Call Number column)

    • OCLC Masterpiece will NOT be deleted: instead delete ANY other OCLC related numbers, including Field 001, OCLC Final for Melanie, Field 001 checked, etc

    • Sierra MQ barcode

    • Barcode added in inventory - make sure this is already in the MARC 949 - $ subfield column, then delete Barcode added in inventory

    • Volume (if multipart) - should be in another view

    • Editable SuDoc

    • Bib Record (linked) - we already have Record#(Biblo as 907)

    • Count of items on Bib

    The remaining Columns should be:

    • SuDoc Call Number

    • OCLC Masterpiece - this will become our 035 field in the next step

    • Title - just for error checking, if needed

    • Record#(Biblio) – (for overlaying in Sierra)

    • MARC 949 - $ Subfield - first item record

  3. Keep these specific columns and renameRename (and edit, as needed) the remaining columns:

    • SuDoc Call Number change to 099$a

    • Title to 245$a - this will be for error checking only

    • Record #(Biblio) change to 907$a

    • MARC 949 - $ Subfield to 949$a

    • OCLC Masterpiece - first do these edits to create two OCLC columns:

      • Insert 2 new columns to the left.

        1. In the second column add the (OCoLC) prefix to the header cell, and then pull down to fill in the rest of the column.

        2. In the first column add the function:

          • =CONCATENATE(F1,G1) ← with F as the prefix column - Gas OCLC Masterpiece

          • click return key

          • First (header) cell should display (OCoLC)OCLC Masterpiece

          • Pull down to populate the rest of the cells in the column with their concatenate functions.

          • Then Copy the entire (OCoLC)OCLC Masterpiece column, and Paste as Values in the same column.

          • Finally, delete the (OCoLC) prefix column and the old OCLC Masterpiece column.

          • Copy the column, then, Insert as new column at the beginning of the spreadsheet

          • Rename

      as
          • : 035$a

        1. Go back to the (OCoLC)OCLC Masterpiece column, then use Find & Select; Replace to edit to OCLC batch-search format:

          • Find: (OCoLC) ← no spaces

          • Replace: * ← asterisk

          • Replace All

          • Rename column

        to
          • : OCLC batchsearch

        1. Now there are two OCLC columns at either end of your spreadsheet:

  4. Save the Excel New Barcodes spreadsheet as Text (Tab Delimited)

    • I usually give it a new name, at this point (but that is optional)

      • NewBarcodes-Y4T68-2-107to49done

MarcEdit Conversion and Edits

  1. Open MarcEdit; then Tools; Delimited Text Translator

    1. Source File:[file name, such as: NewBarcodes-Y4Hearings-Y4-3]

    2. Output File:[new name such as NewbarcodesY4HearingsOverBrief-Y4-3…brief]

    3. Click Import File (formerly Next)

  2. Manually enter in each column (fields) and map them to MARC fields

    1. Using Data Snapshot as a guide

      • Use Select to pick each Data shapshot field

      • In Map To: type MARC tag+1st subfield;

      • Then Indicators are usually the default: \\

        • – except for 245$a which is: 10

      • No terminal punctuation is needed for this project, so leave blank.

      • Then click Add Argument

    2. The field to MARC arguments:

      • Field 0 035a Indicators: \\ no terminal punctuation

      • Field 1 099$a Indicators: \\ no terminal punctuation

      • Field 2 245$a    Indicators: 10 no terminal punctuation             

      • Field 4 907$a Indicators: \\ no terminal punctuation

      • Field 5 949$a Indicators: \\ no terminal punctuation

      • Do not use the OCLC batchsearch column

    3. The MARC fields will show up in the Arguments preview box

    4. you can right-click in the Add Argument preview boxto Delete or Edit if there are any problems

      • Image RemovedImage Added

        you can also select one argument (or more) and use the blue circled arrows to move it up or down

      • If all looks good, click Finish.

  3. Edit the Brief Records in MarcEdit

    • In MarcEdit; open MarcEditor; then File; Open, then navigate to the new MARC mnemonic (.mrc) file

    • You can doublecheck the mappings by looking over the first mock record in the file to see that the tags and mapping match.

      • Image RemovedImage Added

  4. Go to Reports; Field Count to make sure the number of records is correct

    1. (records number should be the numbers of the 001, 035, & 245 field count; HOWEVER, the first mock record will be counted: so subtract 1 for the actual number of records)

      • Image RemovedImage Added
    2. Use Edit; Find; Replace to get rid of Excel’s double-quotes.

      1. Find: " (one double quote)

      2. Replace: (with nothing)

      3. Replace all to clean up the entire file.

      4. Then File; Save (it’s a good idea to do a save after each major edit).

    3. If we were actually using the 245 fields as anything except error checkers, we would fix them; for this project it would be totally optional to fix it.

      • Edit;Find; Find All on =245 for each article (as below) to check for titles with articles that are missing a skip indicator. This is usually caught by MarcEdit, but with all hidden quotation marks…

        • =245 10$aA ← add a single [space] after each to avoid title words starting with 'A'

        • =245 10$aAn ← add a single [space] after each…

        • =245 \0$aThe ← add a single [space] after each…

      • fix any articles found using Edit; Find; Replace:

        1. Find: =245 10$aThe add a single [space]

        2. Replace: =245 14$aThe ← add a single [space]

    4. Do some quick Find’s to check for egregisous errors in our local data.

      1. Edit; Find; Find All

        • Image RemovedImage Added
        • Find: =035

        • Image RemovedImage Added
        • Find: =099

        • Find: =907

          • Edit; Replace

          • Find: =907 \$ab

          • Replace: =907 \$a.b

          • Replace All

        • Find: =949

    5. Now add our project name in the 590 field:

      • Tools; Add/Delete Field

      • Field: 590 Field Data: \\$aY4 Retrocataloging Project 2022.

      • Click Add Field

      • Scroll down to check that the new field lines up with the other fields:

    6. Then scroll down the records to look for egregious errors in our locally generated fields (ignore LDR, 008, & 245), making sure they all line up correctly, but don’t get lost in the weeds.

    7. Then Save the file of edited brief records.

    8. MELANIE HAS EDITED THE WORKFLOW TO HERE

Batch-search Connexion with OCLC numbers:

  1. In Excel and Notepad, create a text file of OCLC numbers for the batch search in OCLC Connexion.

    1. If not already done, copy/paste OCLC numbers to a new column; replace (OCoLC) with * then name the column: OCLC batchsearch.

    2. Select all the OCLC numbers; right-click; choose format cells; & change Category to text.

    3. Copy and paste the cells in the OCLCbatchseach column to Notepad; save as text; with an easy to find title, in this format: OCLCnosOV-year

    4. Note the number of *OCLC rows (records).

  2. In Connexion, go to File; Local File Manager; choose an empty Local Save File path (or make a new one).

    1. Do a search/return key to make sure there are no records lingering in there;

    2. Delete any lingering records.

  3. Go to Batch; Enter Bibliographic Search Keys; Import.

    1. Navigate to your text file of *OCLC numbers, make sure the Default Index is: None, and click Open.

    2. Always click Don’t Delete, when asked.

    3. The text file should load into the Bibliographic Batch Search Window; click Save; then close window.

    4. In Local File Manager; Set up Authorizations; add Authorization & Password; Save & Login.            

    5. Go to Batch; Process Batch;

      1. make sure the empty Local Save File is still selected (if not, select it now).

      2. Only Online Searches should be clicked.

      3. Click OK;

      4. The OCLC numbers should scroll down the view window, until all have been searched.

      5. Finally, a Batch Search Report will appear; go down to the last save number in the report; it should equal the number of *OCLC row numbers that you wrote down earlier.

    6. The Local Save File should have received all the records.

      1. Go to Search; Local Save File; then hit Search, leaving the search box blank.

      2. Select every record in the batch save file.

      3. Export; name the export file, something like this: THESES-OCLCbatchsearchOver-year.

  4. Open MarcEdit; click on MARC Toolsicon; leaveSelect Operation as MarcBreaker;

    1. Open (choose All Files): Theses-OCLCbatchsearchOver-year

    2. Save As: Theses-OCLCbatchsearchOVER-year

    3. Leave Default Character Encoding as MARC8

    4. Click Execute; then click on Edit Records:

      1.  

    5. Edit; Find/Replace: =049 \$aUUSAto =049 \$aUUSP

    6. Delete these fields which we will be replacing with our updated versions:

      1. Tools; Add/Delete Fields, fill in the tag in Field; and leave Field Data blank; then click Delete Field:

        1. Field: 100

        2. Field: 245

        3. Field: 260

        4. Field: 264

        5. Field: 300

        6. Field: 502

        7. Field: 520

      2. For the following Fields, we will be deleting certain duplicate versions, usingAdd/Delete Fields:

        1. Field: 655Field Data: \7$aAcademic theses.$2lcgft

        2. Field: 655Field Data: \4$aAcademic theses.

        3. check for other versions of 655 Academic theses headings to delete, but don’t remove FAST headings:

          • =655 \7$aAcademic theses.$2fast$0(OCoLC)fst01726453

    7. Check that 504 fields have: Includes bibliographical references (and change any incorrect), but don’t add them where they are missing.

      1. Jump to Record to fix incorrect text.

    8. Save for Merge step.

  5. Merge Brief file and OCLC batch-searched file records together.

    1. Go to Tools, MARC Processing Tools, Merge Records, then navigate to the two files, as in this example:

      1. Source file: Theses-OCLCbatchseach-year.mrk    → (OCLC batch-searched/edited records)

      2. Merge File: Theses-mergeOVERLAY-Brief-year.mrk → (brief records with our data)

      3. Save file: Theses-mergedOVERLAY-year.mrk          → (future file of merged records)

      4. Record identifier:  035$a → (Important Matchpoint)

    2. Click Next to begin selecting the fields to merge.

    3. Select/type each field to be merged into the OCLC batch-searched file; click the Green arrow to select each:

      • 049 UUSP

      • 099  local call number

      • 100 Author (our corrected version)

      • 245 Title (ours)

      • 264 RDA production date (ours)

      • 300 Physical description (ours)

      • 500  Department & transcribed note

      • 502  RDA standardized thesis statement

      • 520 Scrapped summary

      • 590 Project name

      • 655  Academic theses genre term

      • 690   local subject strings

      • 710  university

      • 907  Sierra bib-number for overlay

      • 949  automatic item record creation

    4. Then click Next to do the merge.

  6. Batch copy catalog the merged records in MarcEdit:

    1. Reports; Field Count; to check the merge worked (001, 035, & 245 fields should match number of records);

    2. Edit; Find; Find All various fields, search for egregious errors, etc.

    3. Compile cleaned up merged files into MARC format:     ThesesmergedOVERLAYdone-year.mrc

  7. Load file into Sierra and track:

    1. Use the first load table: LOAD/Overlay bibs & create items (.briefbiboverlay), and remember to click on “use review files”;

      1. Do a Test load first; then if the test goes well; Load.

      2. Go to Create Lists; Copy; find file, looking somewhat like this:

        • LOAD: inserted files for ThesesmergedOVERLAYdone-year (#) ← number of bib records loaded in parentheses

      3. Click on a few records and make sure the records loaded properly.

      4. Leave the Load list in Create Lists for later Global Edit in Sierra.

      5. Return to the OVERLAY view in the Thesis Date Pull Airtable & record the Date Loaded in Sierra.

    Report the Load for the Global Update step to the Batch Thesis Cataloger:

    1. Send an email with the subject line: Global Update for Thesis Batch Load.

    2. Give the name of the load in the body of the email, something like this:

      • I have just loaded ThesismergedOVERLAYdone-1926. It is ready to have the item records' call number tags changed from 090 to 099.

      • The item records will be changed from 090 to 099; after the global edit, and the Create Lists file will be removed.

...

Info

Statistics for Merge Overlay Procedures

Report the total number of MARC records (not items) imported into Sierra during this process in the “MARC Batch-Modified” column in the CMS Statistics Airtable base.

Done!

 

New Barcodes, Ready for New Cataloging

Not yet READY: do not use yet

2-Detailed Procedures for New Barcoded, Ready for New Cataloging: (Merge, NO overlay) workflow 

UNDERGOING REVISION: please wait before using

AirTable Extraction:

  1. Go to AirTable Hearings 9-10:

    1. Sign up for a certain section of SuDoc Y4 numbers

      1. Batch Loader: you

      2. Batch Load Status: In Progress

      3. Fill in Batch Load Start Date and (if needed during the process) Notes.

    2. Go to view: Barcodee, Ready for New Cataloging:

      1. Flag any problems that need to returned for more work, checking etc. (incomplete, problem, etc.)

      2. Download CSV file of items by clicking on the down arrow at the top menu, in one of two ways:

        • First filter by SUDOC stem (easily done by adding a filter of Sign up Name is exactly: [your name]--then delete just that filter when done downloading)

        • OR

        • Download entire set and delete other titles after opening file in Excel (probably too bulky to do).

      3. Distinguish the download by changing the CSV name of Imported table-Barcoded, Ready for New Cataloging after downloading, something like this: NewCatalogingY4V2HRG101-103.

Excel Column work:

  1. Open Excel app first

    1. Open; from PC; Desktop; All Files;

    2. Then, navigate to the CSV file (such as NewcatalogingY4V2HRG101-103); then choose Delimited; Next; click in Comma box, Next; choose Text, and Finish.

    3. If you haven’t done so already by filtering to your SUDOC stem in Airtable, delete all but your chosen Y4 group rows from the spreadsheet. or copy/paste your chosen titles to another spreadsheet.

  2. Delete all these columns in the spreadsheet:

    • The 3 sign up columns: Sign up Name, Batch Cataloging Status, Start Date

    • Final Source

    • Stem

    • Condition

    • Batch Cataloging Status

    • Record #(Item) - we don’t use this

    • Record #(Biblio) - we’re not overlaying

    • 86 (since we’ll be using the first SuDoc Call Number column)

    • OCLC Finally for Melanie will NOT be deleted: instead delete ANY other OCLC related numbers

    • Sierra MQ barcode

    • Barcode added in inventory - make sure this is already in the MARC 949 - $ subfield column, then delete Barcode added in inventory

    • Volume (if multipart) - should be in another view

    • Editable SuDoc

    • Bib Record (linked)

    • Count of items on Bib

    The remaining Columns should be:

    • SuDoc Call Number

    • OCLC final for Melanie- this will become our 035 field in the next step

    • Title - just for error checking, if needed

    • MARC 949 - $ Subfield - first item record

  3. Keep these specific columns and rename:

    • SuDoc Call Number change to 099a

    • Title to 245 - for error checking only

    • ADD a new Column:

      • Insert a column; in header name it: 590a

      • In first column add the text: Y4 Retrocataloging Project 2022

      • Pull down to fill the rest of the column cells.

    • MARC 949 - $ Subfield to 949a

    • OCLC final for Melanie/etc.

      1. Insert 2 new columns to the left.

        1. In the second column add an asterisk * to the header cell, and then pull down to fill in the rest of the column.

        2. In the first column add the function:

          • =CONCATENATE(F1,G1) ← with F is asterisk column * Gis OCLC Masterpiece

          • click return key:

          • First (header) cell should display *OCLC final for Melanie…

          • Pull down to populate the rest of the cells in the column with their concatenate functions.

          • Then Copy the entire *OCLC column, and Paste as Values in the same column.

          • Finally, delete * asterisk column and the unadorned OCLC Masterpiece column.

          • Change the new *OCLC column to 035a

          • [image]

  4. Save the Excel New Barcodes file as Text (Tab Delimited)

    • I usually give it a new name, at this point (optional)

MarcEdit Conversion and Edits

  1. Open MarcEdit; then Tools; Delimited Text Translator

    1. Source File:[file name, such as: NewCataloging-Y4Hearings-Y4-3]

    2. Output File:[new name such as NewCataloging4HearingsOverBrief-Y4-3…]

    3. Click Import File (formerly Next)

  2. Manually enter in each column (fields) and map them to MARC fields

    1. Use the Data Snapshot as a guide:

    2. [image]

    3. Click Finish.