Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Click 'OK'. This will populate a new column with a number representing the number of matching lines found in ListB
    • If the "ListB Comparison" column contains a zero (0) then no match has been found
    • If the "ListB Comparison" column contains a one (1) then a single match has been found
    • If the "ListB Comparison" column contains a two (2) then two matches have been found
    • etc.
  • Facet on the new 'ListB Comparison' column to find those lines in ListA that do not appear in ListB (a zero in the column)
  • To identify journals that are in ListB but not in ListA, the same process is carried out starting with the 'title.identifier.issn' column in the 'ListB' project

Anchor
comparison-with-title-eg
comparison-with-title-eg
Comparison with titles example

  • Two lists of journals have been used to create Refine projects (ListA and ListB), and both projects have been edited so they contain a column named 'title.identifier.issn' containing ISSNs and a column named 'PublicationTitle' containing journal titles
  • Open ListA, click the 'title.identifier.issn' column header and choose 'Edit Column' -> 'Add column based on this column ...' from the dropdown menu
  • In the 'New column name' box enter a column name such as "ListB Comparison" (any name will do)
  • In the 'Expression' box enter the text:
    • forEach(cell.cross("ListB","title.identifier.issn"),r,r.cells.PublicationTitle.value).join("|")
  • Click 'OK'. This will populate a new column with all matching titles found for the relevant ISSN, with a 'pipe' character | between each title. The pipe character was chosen simply because it is unlikely to appear in any journal title. Some other separator by amending the expression above. If there is no match the relevant cell will be blank
  • To find all the rows in ListA which found no match in ListB, click on the column header drop down in the new "ListB Comparison" column and choose 'Facet' -> 'Customized Facets' -> 'Facet by blank'
    • In the resulting Facet those rows found by the 'true' facet (i.e. there is a blank in the column) are the ones which found no match

...

forEach(cell.cross("ListB","title.identifier.issn"),r,r.cells.CoverageNotes.value).join("|")

Anchor
comparison-iterative-match-eg
comparison-iterative-match-eg
Comparison using iterative matching example

While ISSN and eISSN are good match points due to their consistency and prevalence in journal lists, it is not unusual in a single list for some titles to have an ISSN but no eISSN and vice-versa. It is also not unusual for some titles to have neither an ISSN or an eISSN. Because the 'cross' lookup relies on finding matches, if in one list Journal Title A only has an ISSN, and in the other list Journal Title A only has an eISSN (in a different column) then a comparison would suggest (falsely) that Journal Title A was missing from the second list. Therefor it is sometimes useful to run matches for several columns across the two lists before looking at the outcome of the comparison.

...