Skip to content

subgenus = Incertae sedis then name string doesn't parse, also strange looking quality values #277

@debpaul

Description

@debpaul

Raw data (unparsed): beulah-first-5000-name-strings-unparsed.csv

Modified GNParsed Data Set: beulath-taxonnames-gnparsed-first-5000-rows.txt

  • added family column, value = Carabidae
  • opened file in Notepad ++
  • changed CRLF line endings to UNIX (LF) (b/c upload to TW batch requires this)

Noticed

  • the Quality values look strange? Maybe on import into Excel, I need to select a certain data type for this field?
    Image

  • see also line 11 above where the value pseudoflavipes appears changed to pseudoflavipe0s in CanonicalFull column (also lines 116, 117)

    • don't know where that 0 comes from
  • see also Author Year leading and trailing 0. Not sure where they are coming from either
    Image

  • More 0 issues (and delimiters issue?), origin uncertain
    Image

  • Some names did not parse. (Not sure why). See screenshot next. Maybe because all these names have subgenus = (Incertae sedis) and GN doesn't recognize this value at this rank?

Image

  • In general, subgenus is missing from all parsed values.

Maybe in future?

  • option to parse (further atomize) down to lowest rank provided

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions