Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This page is designed to be a quick reference guide for the GOKb data loading and ingest process. For more detailed information on each toptopic, please refer to the tutorials linked within the page. If you have not already taken training, contact Jennifer Solomon, GOKb Editor, to set up a time.

Load a file into OpenRefine

  • Open OpenRefine and log into the GOKb extension. Choose Create Project from the left-hand menu. Click Browse and locate the file you want to work with. Click Next. OpenRefine will show you a preview of your data. Scan it to make sure everything looks correct.

...

  • Click Create Project in the top right corner. Your project will automatically open.

Check a File Into GOKb

  • Click the GOKb button located in the top right corner of the screen. Select Check in this project for the first time.

  • You will be asked to provide the Source, Provider, Name, Description, and Notes (optional). Click Save and Check In.


Clean up data in OpenRefine

Use Macros to quickly rename columns

...

  • The following three columns will always need to be added, and will require you to look up a controlled value to populate the data: platform.host.name, package.name column, org.publisher.name

  • The following columns are optional, and you may choose to add them if your data happens to contain extra information about these fields. If you are missing this information, you can omit these columns: TIPPPayment, TIPPStatus, Title.OAStatus

  • Address remaining invalid data errors and warnings

  • Review additional fields and load these as custom columns

Ingest a Project Into GOKb

  • In the left hand navigation pane of OpenRefine, navigate to the Errors tab. Click the Update GOKb pane. Proceed by clicking Proceed with Ingest. Make any necessary changes, then click Save and Check-In.

  • Wait until the project is 100% ingested before moving on to the GOKb web app.

Verify a package record

  • Use the Search>Packages menu to locate the package you are working with.

  • Confirm that your package name is correct. If you think that people might search for the package using a term that isn't contained in the name, add a variant name.

...