-
Notifications
You must be signed in to change notification settings - Fork 1
127 Plazi datasets that are very likely to have classification issues ACC-ACC species (same authors) #362
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
We'll be gradually fixing these in the next couple of weeks |
Dear @camiplata might it be possible to include in your report the Plazi/TB URL of the respective treatment that needs fixing? Right now, you include in the "url" column the URL of the entire article data set. for the first example in the CSV with this would speed up our work d |
So, I've looked through the first 2 reports in the spreadsheet, and here's what I found out:
Is there a possibility that this shortlist was taken not very recently and some things could've already been reported/fixed before? |
|
Yes @myrmoteras I can. Nevertheless, although I may detect the issue with individual names/treatements, it sometimes reflects a pattern affecting several treatments within the same dataset. For example for #360 I noticed the issue with one species name, but upon reviewing the dataset, I found that more species names were affected. These additional names may or may not affect the COL extended release, but they do impact the overall quality of the dataset. Therefore, I believe it is worth reviewing the entire dataset. It may also help to have the CLB link to the dataset tree view, allowing you to quickly assess any incongruities, @flsimoes would it bu useful for you? |
All done! |
This is Awesome Felipe! thank you |
I did a review of the duplicates on accepted names on the extended release of Catalogue of life and found that several of them are caused by PLAZI datasets that have problems within its classification:
See some detailed examples here:
For some of these datasets there may be already an individual issue reported, but for most of them there is not.
Here you can find the list of Plazi's datasets whose classification is very likely to need a fix:
PLAZIduplicates_20241227.csv
The text was updated successfully, but these errors were encountered: