resolve_once behavior #32

sckott · 2014-03-03T19:52:29Z

Curious if resolve_once behavior is correct e.g., this call http://resolver.globalnames.org/name_resolvers.json?names=Plantago+major&resolve_once=true returns more than one match (all from different sources I think, but more than 1 match)

The API docs suggest setting resolve_once=TRUE should just get first match.

The text was updated successfully, but these errors were encountered:

dimus · 2014-03-03T20:15:14Z

I guess the option name is a bit confusing, the idea behind it was to avoid name parsing and return only exact matches if possible. In your example only exact matches (from several data sources) are returned, so parsing and matching by canonical form did not happen. It is faster than running name parser, but also removes quite a few results. Because of that resolve once is disabled by default. Are you interested in getting one result only?

You will see the difference if you change resolve_once=true to resolve_once=false

sckott · 2014-03-03T20:33:20Z

Thanks for your quick response! Hmm, I guess just getting one result doesn't make too much sense, so no, I don't think that's needed, and none of the users of my software ask for it.

I'll change the documentation in my software so that the resolve_once parameter is described more accurately.

sckott · 2014-03-04T16:33:42Z

Hi again @dimus - Actually, a user just asked about possibly returning just one match for each queried taxon name. Is that possible? It seems a bit tricky to do so, and may require a few possible choices. For example, if the parameter is called return_one, then could pick at random from a set of equivalent names (return_one=random), or pick from preferred data source (return_one=12, 12 for EOL), or other options?

We could do this on our side in R, but of course it make for faster data return times if it is done on your side.

dimus · 2014-03-10T19:46:57Z

oups, missed your new comment. There is not yet documented way to do something like that:

http://resolver.globalnames.org/name_resolvers.json?names=Plantago+major&best_match_only=true&data_source_ids=12

if no data_source_ids are given -- all of them will be used

In addition it is possible to add preferred_data_sources to best_match only. If no data_source_ids are given 'best match' will come from any data source. In addition if there is a match in the 'preffered data source' it will also be returned.

http://resolver.globalnames.org/name_resolvers.json?names=Plantago+major&best_match_only=true&preferred_data_sources=12|4

for now only BHL uses this functionality. If you will start to use it I will give it 'official' status and document it on the API page

sckott · 2014-03-10T20:31:33Z

Great, thanks for this, I'll include these two parameters in my taxize R package. I don't know how much people will use them - I'm sure at least some will.

tucotuco · 2014-12-05T17:58:02Z

Looking forward to official status. This will be immensely useful for data improvement workflows in general. VertNet gives a "+1".

sckott mentioned this issue Mar 3, 2014

gnr_resolve potential bug ropensci/taxize#248

Closed

dimus added in progress ready and removed in progress labels Aug 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resolve_once behavior #32

resolve_once behavior #32

sckott commented Mar 3, 2014

dimus commented Mar 3, 2014

sckott commented Mar 3, 2014

sckott commented Mar 4, 2014

dimus commented Mar 10, 2014

sckott commented Mar 10, 2014

tucotuco commented Dec 5, 2014

resolve_once behavior #32

resolve_once behavior #32

Comments

sckott commented Mar 3, 2014

dimus commented Mar 3, 2014

sckott commented Mar 3, 2014

sckott commented Mar 4, 2014

dimus commented Mar 10, 2014

sckott commented Mar 10, 2014

tucotuco commented Dec 5, 2014