Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Search in Khmer does not work well. Autosuggest works. #22

Open
eroux opened this issue Dec 14, 2024 · 2 comments
Open

Search in Khmer does not work well. Autosuggest works. #22

eroux opened this issue Dec 14, 2024 · 2 comments
Assignees

Comments

@eroux
Copy link
Contributor

eroux commented Dec 14, 2024

សម្មាសម្ពុទ្ធ appears in autosuggest but gives zero hits.

@roopeux
Copy link
Collaborator

roopeux commented Jan 28, 2025

@eroux
khmr-lenient returns an empty token list probably for all keywords

For example, prefLabel_khmr of MW1FEMC010001 is "មហាវេស្សន្តរជាតក"

Test the analyzer:

POST /bdrc_prod/_analyze
{
  "text": "មហាវេស្សន្ដរជាតក",
  "analyzer": "khmr-lenient"
}

# returns an empty list

where as something like the phonetics analyzer...

POST /bdrc_prod/_analyze
{
  "text": "Khyentse",
  "analyzer": "ewts-for-english-phonetic"
}

...returns a proper token list

@eroux
Copy link
Contributor Author

eroux commented Jan 28, 2025

thanks! I'll investigate that

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants