Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pt] Added and sorted alphabetically words #11103

Merged
merged 2 commits into from
Dec 13, 2024

Conversation

marcoagpinto
Copy link
Member

@marcoagpinto marcoagpinto commented Dec 13, 2024

Added words.

Summary by CodeRabbit

  • New Features

    • Expanded vocabulary in the Portuguese part-of-speech dictionary with new entries such as "Airbus", "Linux", and "rockstar".
    • Enhanced spelling resource with new terms including "acordéon", "aftershave", and "agreement", reflecting a diverse vocabulary.
  • Bug Fixes

    • Removed outdated or questionable entries from both the part-of-speech dictionary and spelling resource.

Copy link
Contributor

coderabbitai bot commented Dec 13, 2024

Warning

Rate limit exceeded

@marcoagpinto has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 2 minutes and 46 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between a810195 and 7b762a7.

📒 Files selected for processing (1)
  • languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (9 hunks)

Walkthrough

This pull request includes updates to the Portuguese part-of-speech dictionary and spelling files. The added.txt file has seen numerous modifications, including the addition, removal, and modification of vocabulary entries, with new words and their respective tags being introduced. The spelling.txt file underwent significant changes as well, featuring the removal of outdated terms and the addition of new vocabulary to enhance the diversity of the spelling resource. Both files reflect collaborative efforts to maintain and expand the language resources.

Changes

File Path Change Summary
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt - Added multiple new entries including "Airbus", "Linux", and "rockstar".
- Removed entries like "Ophiuchus" and "mirrorless".
- Added a comment for manual entry.
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt - Removed numerous entries including "diskette" and "écarté".
- Added new entries such as "acordéon" and "aftershave".
- Expanded vocabulary with new terms at the end.

Possibly related PRs

Suggested reviewers

  • jaumeortola
  • p-goulart

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Experiment)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (3)
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (1)

44-46: Consider standardizing comment formatting

The manual additions section header uses inconsistent comment formatting with extra '#' characters.

-######################################################
-######################################################
-# Add entries manually from here                     #
-######################################################
-######################################################
+# Manual Additions
+# Add entries manually below this line
🧰 Tools
🪛 LanguageTool

[duplication] ~45-~45: Possible typo: you repeated a word
Context: ...#### # # problematic entries, to review ad ad NCCSS00 ads ad NCCPS00 Airbus Airbus NC...

(ENGLISH_WORD_REPEAT_RULE)

languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (2)

309-309: Consider rephrasing the comment more professionally.

The current comment provides good context but could be rephrased more professionally:

-# Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dictionary building.
+# Temporary additions by Marco A.G.Pinto pending dictionary building system update.
🧰 Tools
🪛 LanguageTool

[style] ~309-~309: Consider using the more polite verb “ask” (“tell” implies ordering/instructing someone).
Context: ...yod # Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dicti...

(TELL_ASK)


Alphabetical ordering issue detected in Portuguese word list

The word list is not properly alphabetized. The word "Balikó" should come after "baratinho" and "baratinhos" according to Portuguese alphabetical ordering rules, where lowercase letters should be sorted before uppercase letters.

  • Current order: autoevidências -> autoinflingir -> Balikó -> baratinho -> baratinhos
  • Expected order: autoevidências -> autoinflingir -> baratinho -> baratinhos -> Balikó
🔗 Analysis chain

Line range hint 310-409: LGTM! Well-structured Portuguese additions.

The new Portuguese words are well-formatted and include a good mix of terms. The alphabetical ordering appears correct.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Verify alphabetical ordering of Portuguese words
echo "Checking alphabetical ordering..."
sed -n '310,409p' languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt | sort -c || echo "Warning: Words are not in alphabetical order"

Length of output: 326


Script:

#!/bin/bash
# Let's get the actual words around 'Balikó' to see the specific ordering issue
echo "Words around 'Balikó':"
sed -n '310,409p' languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt | grep -B 2 -A 2 "Balikó"

Length of output: 263

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 84955d8 and a810195.

📒 Files selected for processing (2)
  • languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (10 hunks)
  • languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (9 hunks)
🧰 Additional context used
🪛 LanguageTool
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt

[style] ~309-~309: Consider using the more polite verb “ask” (“tell” implies ordering/instructing someone).
Context: ...yod # Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dicti...

(TELL_ASK)

languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt

[duplication] ~12-~12: Possible typo: you repeated a word
Context: ...iverso DI0MP0 diversos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~13-~13: Possible typo: you repeated a word
Context: ...sos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo D...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~14-~14: Possible typo: you repeated a word
Context: ...I0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo DD0FS0 mesmíss...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~17-~17: Possible typo: you repeated a word
Context: ...mo DD0FS0 mesmíssimas mesmíssimo DD0FP0 mesmíssimo mesmíssimo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK...

(ENGLISH_WORD_REPEAT_RULE)


[uncategorized] ~19-~19: Consider adding a comma between these interjections.
Context: ...mo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquí...

(OK_OK_COMMA)


[uncategorized] ~20-~20: Consider adding a comma between these interjections.
Context: ... mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0...

(OK_OK_COMMA)


[uncategorized] ~21-~21: Consider adding a comma between these interjections.
Context: ...mos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0FS0 pouqu...

(OK_OK_COMMA)


[duplication] ~45-~45: Possible typo: you repeated a word
Context: ...#### # # problematic entries, to review ad ad NCCSS00 ads ad NCCPS00 Airbus Airbus NC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~47-~47: Possible typo: you repeated a word
Context: ... to review ad ad NCCSS00 ads ad NCCPS00 Airbus Airbus NCCNO00 Alpha Alpha AQ0CS0 Alpha Alpha ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~48-~48: Possible typo: you repeated a word
Context: ...00 ads ad NCCPS00 Airbus Airbus NCCNO00 Alpha Alpha AQ0CS0 Alpha Alpha NCCS000 Alphas Alpha...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~61-~61: Possible typo: you repeated a word
Context: ...ottom-up bottom-up NCCN000 BT BT AQ0CN0 bull bull NCMS000 C2 C2 NCMN000 card card NCMS000...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~84-~84: Possible typo: you repeated a word
Context: ...d NCFP000 desenrolar desenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~85-~85: Possible typo: you repeated a word
Context: ...esenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~86-~86: Possible typo: you repeated a word
Context: ... direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 ep...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~88-~88: Possible typo: you repeated a word
Context: ...ts NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 epóxi epóxi AQ0CN0 extras extra...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~89-~89: Possible typo: you repeated a word
Context: ...Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 epóxi epóxi AQ0CN0 extras extra AQ0CP0 flag flag NC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~91-~91: Possible typo: you repeated a word
Context: ... epóxi epóxi AQ0CN0 extras extra AQ0CP0 flag flag NCFS000 flags flag NCFP000 footnote foo...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~92-~92: You’ve repeated a verb. Did you mean to only write one of them?
Context: ...0 extras extra AQ0CP0 flag flag NCFS000 flags flag NCFP000 footnote footnote NCFS000 footn...

(REPEATED_VERBS)


[duplication] ~93-~93: Possible typo: you repeated a word
Context: ...P0 flag flag NCFS000 flags flag NCFP000 footnote footnote NCFS000 footnotes footnote NCFP000 frie...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~94-~94: You’ve repeated a verb. Did you mean to only write one of them?
Context: ... flag NCFP000 footnote footnote NCFS000 footnotes footnote NCFP000 friendly friendly AQ0CS0 frontm...

(REPEATED_VERBS)


[duplication] ~95-~95: Possible typo: you repeated a word
Context: ...note NCFS000 footnotes footnote NCFP000 friendly friendly AQ0CS0 frontman frontman NCMS000 FSR FS...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~96-~96: Possible typo: you repeated a word
Context: ...otnote NCFP000 friendly friendly AQ0CS0 frontman frontman NCMS000 FSR FSR AQ0CN0 gap gap NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~97-~97: Possible typo: you repeated a word
Context: ...iendly AQ0CS0 frontman frontman NCMS000 FSR FSR AQ0CN0 gap gap NCMS000 gap gap NCCS00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~109-~109: Possible typo: you repeated a word
Context: ...ghts height NCFP000 hidro hidro NCFS000 idle idle NCMN000 iFood iFood AQ0CN0 layer layer ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~110-~110: Possible typo: you repeated a word
Context: ...0 hidro hidro NCFS000 idle idle NCMN000 iFood iFood AQ0CN0 layer layer NCFS000 layers layer...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~111-~111: Possible typo: you repeated a word
Context: ...00 idle idle NCMN000 iFood iFood AQ0CN0 layer layer NCFS000 layers layer NCFP000 Linux Linu...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~112-~112: You’ve repeated a verb. Did you mean to only write one of them?
Context: ... iFood iFood AQ0CN0 layer layer NCFS000 layers layer NCFP000 Linux Linux AQ0CN0 lossless los...

(REPEATED_VERBS)


[duplication] ~113-~113: Possible typo: you repeated a word
Context: ...ayer layer NCFS000 layers layer NCFP000 Linux Linux AQ0CN0 lossless lossless NCCN000 Marghe...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~114-~114: Possible typo: you repeated a word
Context: ...layers layer NCFP000 Linux Linux AQ0CN0 lossless lossless NCCN000 Margherita Margherita AQ0FS0 Ma...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~125-~125: Possible typo: you repeated a word
Context: ...MasterCard AQ0CN0 média média:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~126-~126: Possible typo: you repeated a word
Context: ...:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Op...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~128-~128: Possible typo: you repeated a word
Context: ...N0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~129-~129: Possible typo: you repeated a word
Context: ...oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~130-~130: Possible typo: you repeated a word
Context: ...chus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~131-~131: Possible typo: you repeated a word
Context: ...former performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~132-~132: Possible typo: you repeated a word
Context: ... NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~133-~133: Possible typo: you repeated a word
Context: ...Q0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~134-~134: Possible typo: you repeated a word
Context: ...CMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte Play...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~135-~135: Possible typo: you repeated a word
Context: ... NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~136-~136: Possible typo: you repeated a word
Context: ...Q0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular p...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~137-~137: Possible typo: you repeated a word
Context: ...y Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~138-~138: Possible typo: you repeated a word
Context: ...e PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~139-~139: Possible typo: you repeated a word
Context: ...podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~140-~140: Possible typo: you repeated a word
Context: ...lar popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~141-~141: Possible typo: you repeated a word
Context: ... prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~142-~142: Possible typo: you repeated a word
Context: ...Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~143-~143: Possible typo: you repeated a word
Context: ...0 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 quentinhos quentinho NCMP000 R$...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~145-~145: Possible typo: you repeated a word
Context: ...ho NCMS000 quentinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~146-~146: Possible typo: you repeated a word
Context: ...ntinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Reb...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~147-~147: Possible typo: you repeated a word
Context: ...nho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~148-~148: Possible typo: you repeated a word
Context: ... R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~149-~149: Possible typo: you repeated a word
Context: ... R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockst...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~150-~150: Possible typo: you repeated a word
Context: ...000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rocks...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~151-~151: Possible typo: you repeated a word
Context: ...0 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rockstars rockstar NCCP000 S S ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~153-~153: Possible typo: you repeated a word
Context: ...star NCCS000 rockstars rockstar NCCP000 S S AO0CN0 s s AO0CN0 Samsung Samsung AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~168-~168: Possible typo: you repeated a word
Context: ...a spalla NCFS000 spallas spalla NCFP000 Sr Sr NCMS000 Sra Sra NCFS000 Sras Sra NCFP00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~255-~255: Possible typo: you repeated a word
Context: ...ção NCFP000 Whirlpool Whirlpool NPFS000 Woodcock Woodcock NPMS000 WritingTool WritingTool NPMN000...

(ENGLISH_WORD_REPEAT_RULE)

🔇 Additional comments (3)
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (1)

Line range hint 1-255: Overall changes look good

The additions and modifications maintain the required format and structure of the part-of-speech dictionary. The entries are properly formatted with the correct tab-separated fields (fullform baseform postags).

🧰 Tools
🪛 LanguageTool

[duplication] ~122-~122: Possible typo: you repeated a word
Context: ...arketplace NCMP000 Marvel Marvel AQ0CN0 Mastercard Mastercard AQ0CN0 MasterCard MasterCard AQ0CN0 méd...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~123-~123: Possible typo: you repeated a word
Context: ...vel AQ0CN0 Mastercard Mastercard AQ0CN0 MasterCard MasterCard AQ0CN0 média média:2 NCMP000 mirrorless...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~124-~124: Possible typo: you repeated a word
Context: ...ard AQ0CN0 MasterCard MasterCard AQ0CN0 média média:2 NCMP000 mirrorless mirrorless AQ0CN0 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~125-~125: Possible typo: you repeated a word
Context: ...MasterCard AQ0CN0 média média:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~126-~126: Possible typo: you repeated a word
Context: ...:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Op...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~128-~128: Possible typo: you repeated a word
Context: ...N0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~129-~129: Possible typo: you repeated a word
Context: ...oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~130-~130: Possible typo: you repeated a word
Context: ...chus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~131-~131: Possible typo: you repeated a word
Context: ...former performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~132-~132: Possible typo: you repeated a word
Context: ... NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~133-~133: Possible typo: you repeated a word
Context: ...Q0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~134-~134: Possible typo: you repeated a word
Context: ...CMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte Play...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~135-~135: Possible typo: you repeated a word
Context: ... NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~136-~136: Possible typo: you repeated a word
Context: ...Q0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular p...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~137-~137: Possible typo: you repeated a word
Context: ...y Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~138-~138: Possible typo: you repeated a word
Context: ...e PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~139-~139: Possible typo: you repeated a word
Context: ...podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~140-~140: Possible typo: you repeated a word
Context: ...lar popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~141-~141: Possible typo: you repeated a word
Context: ... prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~142-~142: Possible typo: you repeated a word
Context: ...Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~143-~143: Possible typo: you repeated a word
Context: ...0 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 quentinhos quentinho NCMP000 R$...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~145-~145: Possible typo: you repeated a word
Context: ...ho NCMS000 quentinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~146-~146: Possible typo: you repeated a word
Context: ...ntinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Reb...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~147-~147: Possible typo: you repeated a word
Context: ...nho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~148-~148: Possible typo: you repeated a word
Context: ... R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~149-~149: Possible typo: you repeated a word
Context: ... R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockst...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~150-~150: Possible typo: you repeated a word
Context: ...000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rocks...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~151-~151: Possible typo: you repeated a word
Context: ...0 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rockstars rockstar NCCP000 S S ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~153-~153: Possible typo: you repeated a word
Context: ...star NCCS000 rockstars rockstar NCCP000 S S AO0CN0 s s AO0CN0 Samsung Samsung AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~154-~154: Possible typo: you repeated a word
Context: ...0 rockstars rockstar NCCP000 S S AO0CN0 s s AO0CN0 Samsung Samsung AQ0CN0 Selectra ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~155-~155: Possible typo: you repeated a word
Context: ... rockstar NCCP000 S S AO0CN0 s s AO0CN0 Samsung Samsung AQ0CN0 Selectra Selectra NCCS000 shelte...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~156-~156: Possible typo: you repeated a word
Context: ...O0CN0 s s AO0CN0 Samsung Samsung AQ0CN0 Selectra Selectra NCCS000 shelter shelter NCMS000 shelter...

(ENGLISH_WORD_REPEAT_RULE)

languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (2)

1-1: LGTM! Clear section header.

The comment effectively indicates that these are foreign words requiring review.


Line range hint 2-307: Consider enhancing foreign words organization and validation.

While the additions expand the dictionary coverage, consider these improvements:

  1. Group words by origin language (e.g., English, French) using subsection comments
  2. Verify common usage in Portuguese contexts for technical terms
  3. Consider adding usage frequency indicators

Example structure:

# English origin
aftershave
basketball
...

# French origin
acordéon
assemblage
...

performer performer NCCFS00
pet pet AQ0CN0
pet pet NCMS000
pets pets NCMP000
pix pix AQ0CN0
pix pix NCMS000
Play Play NCMS000
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Remove trailing spaces from entries

Several entries contain trailing spaces that should be removed for consistency:

  • Line 136: popular
  • Lines 190-197: Multiple entries including todinha , todinhas , todinho , todinhos

These trailing spaces could cause issues in text processing.

Also applies to: 190-197

🧰 Tools
🪛 LanguageTool

[duplication] ~136-~136: Possible typo: you repeated a word
Context: ...Q0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular p...

(ENGLISH_WORD_REPEAT_RULE)

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

🧹 Outside diff range and nitpick comments (3)
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (3)

Line range hint 2-307: Consider recategorizing well-established terms

Some words in the "questionable foreign words" section appear to be well-established in Portuguese usage:

  1. Portuguese adaptations (e.g., "básquete")
  2. Technical terms (e.g., "server", "package", "overwrite")
  3. Widely used foreign terms (e.g., "ballet", "budget", "sport")

Consider moving these established terms to a separate section or the main dictionary.


Line range hint 310-409: Consider reorganizing special terms

The Portuguese words section contains mixed categories of terms. Consider organizing them into separate sections:

  1. Brand names and proper nouns (e.g., "Whirlpool", "Woodcock")
  2. Technical terms (e.g., "WritingTool")
  3. Regular Portuguese vocabulary

This would improve maintainability and make future reviews easier.


Words are not properly sorted alphabetically

Several words in the "Questionable foreign words" section are out of alphabetical order:

  • bâton > bazooka
  • béguin > bibelot
  • boîte > bold
  • compère > complot
  • couché > couchette
  • coupé > coupon
  • début > debye
  • démarche > dessert
  • détente > diskette
  • écarté > engagé
  • maître > majorette
  • mêlée > monokini
  • pâté > paul

The Portuguese words section appears to be properly sorted.

🔗 Analysis chain

Line range hint 1-409: Verify alphabetical ordering

Let's confirm that all words are properly sorted alphabetically within their sections.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check alphabetical ordering within each section

# Function to check if a section is sorted
check_section() {
    local section_name="$1"
    local start_pattern="$2"
    local end_pattern="$3"
    
    echo "Checking section: $section_name"
    sed -n "/$start_pattern/,/$end_pattern/p" languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt | 
    grep -v '^#' | # Remove comments
    grep -v '^$' | # Remove empty lines
    awk 'NR>1{if ($0 < prev) print "Out of order: " prev " > " $0} {prev=$0}'
}

# Check foreign words section
check_section "Foreign words" "^# Questionable foreign words" "^$"

# Check Portuguese words section
check_section "Portuguese words" "^# Added by Marco" "$"

Length of output: 1382

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 84955d8 and a810195.

📒 Files selected for processing (2)
  • languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (10 hunks)
  • languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (9 hunks)
🧰 Additional context used
🪛 LanguageTool
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt

[style] ~309-~309: Consider using the more polite verb “ask” (“tell” implies ordering/instructing someone).
Context: ...yod # Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dicti...

(TELL_ASK)

languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt

[duplication] ~12-~12: Possible typo: you repeated a word
Context: ...iverso DI0MP0 diversos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~13-~13: Possible typo: you repeated a word
Context: ...sos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo D...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~14-~14: Possible typo: you repeated a word
Context: ...I0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo DD0FS0 mesmíss...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~17-~17: Possible typo: you repeated a word
Context: ...mo DD0FS0 mesmíssimas mesmíssimo DD0FP0 mesmíssimo mesmíssimo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK...

(ENGLISH_WORD_REPEAT_RULE)


[uncategorized] ~19-~19: Consider adding a comma between these interjections.
Context: ...mo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquí...

(OK_OK_COMMA)


[uncategorized] ~20-~20: Consider adding a comma between these interjections.
Context: ... mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0...

(OK_OK_COMMA)


[uncategorized] ~21-~21: Consider adding a comma between these interjections.
Context: ...mos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0FS0 pouqu...

(OK_OK_COMMA)


[duplication] ~45-~45: Possible typo: you repeated a word
Context: ...#### # # problematic entries, to review ad ad NCCSS00 ads ad NCCPS00 Airbus Airbus NC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~47-~47: Possible typo: you repeated a word
Context: ... to review ad ad NCCSS00 ads ad NCCPS00 Airbus Airbus NCCNO00 Alpha Alpha AQ0CS0 Alpha Alpha ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~48-~48: Possible typo: you repeated a word
Context: ...00 ads ad NCCPS00 Airbus Airbus NCCNO00 Alpha Alpha AQ0CS0 Alpha Alpha NCCS000 Alphas Alpha...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~61-~61: Possible typo: you repeated a word
Context: ...ottom-up bottom-up NCCN000 BT BT AQ0CN0 bull bull NCMS000 C2 C2 NCMN000 card card NCMS000...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~84-~84: Possible typo: you repeated a word
Context: ...d NCFP000 desenrolar desenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~85-~85: Possible typo: you repeated a word
Context: ...esenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~86-~86: Possible typo: you repeated a word
Context: ... direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 ep...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~88-~88: Possible typo: you repeated a word
Context: ...ts NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 epóxi epóxi AQ0CN0 extras extra...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~89-~89: Possible typo: you repeated a word
Context: ...Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 epóxi epóxi AQ0CN0 extras extra AQ0CP0 flag flag NC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~91-~91: Possible typo: you repeated a word
Context: ... epóxi epóxi AQ0CN0 extras extra AQ0CP0 flag flag NCFS000 flags flag NCFP000 footnote foo...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~92-~92: You’ve repeated a verb. Did you mean to only write one of them?
Context: ...0 extras extra AQ0CP0 flag flag NCFS000 flags flag NCFP000 footnote footnote NCFS000 footn...

(REPEATED_VERBS)


[duplication] ~93-~93: Possible typo: you repeated a word
Context: ...P0 flag flag NCFS000 flags flag NCFP000 footnote footnote NCFS000 footnotes footnote NCFP000 frie...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~94-~94: You’ve repeated a verb. Did you mean to only write one of them?
Context: ... flag NCFP000 footnote footnote NCFS000 footnotes footnote NCFP000 friendly friendly AQ0CS0 frontm...

(REPEATED_VERBS)


[duplication] ~95-~95: Possible typo: you repeated a word
Context: ...note NCFS000 footnotes footnote NCFP000 friendly friendly AQ0CS0 frontman frontman NCMS000 FSR FS...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~96-~96: Possible typo: you repeated a word
Context: ...otnote NCFP000 friendly friendly AQ0CS0 frontman frontman NCMS000 FSR FSR AQ0CN0 gap gap NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~97-~97: Possible typo: you repeated a word
Context: ...iendly AQ0CS0 frontman frontman NCMS000 FSR FSR AQ0CN0 gap gap NCMS000 gap gap NCCS00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~109-~109: Possible typo: you repeated a word
Context: ...ghts height NCFP000 hidro hidro NCFS000 idle idle NCMN000 iFood iFood AQ0CN0 layer layer ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~110-~110: Possible typo: you repeated a word
Context: ...0 hidro hidro NCFS000 idle idle NCMN000 iFood iFood AQ0CN0 layer layer NCFS000 layers layer...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~111-~111: Possible typo: you repeated a word
Context: ...00 idle idle NCMN000 iFood iFood AQ0CN0 layer layer NCFS000 layers layer NCFP000 Linux Linu...

(ENGLISH_WORD_REPEAT_RULE)


[grammar] ~112-~112: You’ve repeated a verb. Did you mean to only write one of them?
Context: ... iFood iFood AQ0CN0 layer layer NCFS000 layers layer NCFP000 Linux Linux AQ0CN0 lossless los...

(REPEATED_VERBS)


[duplication] ~113-~113: Possible typo: you repeated a word
Context: ...ayer layer NCFS000 layers layer NCFP000 Linux Linux AQ0CN0 lossless lossless NCCN000 Marghe...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~114-~114: Possible typo: you repeated a word
Context: ...layers layer NCFP000 Linux Linux AQ0CN0 lossless lossless NCCN000 Margherita Margherita AQ0FS0 Ma...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~125-~125: Possible typo: you repeated a word
Context: ...MasterCard AQ0CN0 média média:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~126-~126: Possible typo: you repeated a word
Context: ...:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Op...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~128-~128: Possible typo: you repeated a word
Context: ...N0 oper oper NCCS000 opers oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~129-~129: Possible typo: you repeated a word
Context: ...oper NCCP000 Ophiuchus Ophiuchus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~130-~130: Possible typo: you repeated a word
Context: ...chus AQ0CN0 performer performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~131-~131: Possible typo: you repeated a word
Context: ...former performer NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~132-~132: Possible typo: you repeated a word
Context: ... NCCFS00 pet pet AQ0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~133-~133: Possible typo: you repeated a word
Context: ...Q0CN0 pet pet NCMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~134-~134: Possible typo: you repeated a word
Context: ...CMS000 pets pets NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte Play...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~135-~135: Possible typo: you repeated a word
Context: ... NCMP000 pix pix AQ0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~136-~136: Possible typo: you repeated a word
Context: ...Q0CN0 pix pix NCMS000 Play Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular p...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~137-~137: Possible typo: you repeated a word
Context: ...y Play NCMS000 PlayArte PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~138-~138: Possible typo: you repeated a word
Context: ...e PlayArte AQ0CN0 podere podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~139-~139: Possible typo: you repeated a word
Context: ...podere NCMSV00 popular popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~140-~140: Possible typo: you repeated a word
Context: ...lar popular NCCS000 prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~141-~141: Possible typo: you repeated a word
Context: ... prime prime AQ0CN0 Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~142-~142: Possible typo: you repeated a word
Context: ...Profa Profa NCFS000 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 que...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~143-~143: Possible typo: you repeated a word
Context: ...0 PrtSc PrtSc NCCS000 Puma Puma NCCSO00 quentinho quentinho NCMS000 quentinhos quentinho NCMP000 R$...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~145-~145: Possible typo: you repeated a word
Context: ...ho NCMS000 quentinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~146-~146: Possible typo: you repeated a word
Context: ...ntinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Reb...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~147-~147: Possible typo: you repeated a word
Context: ...nho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~148-~148: Possible typo: you repeated a word
Context: ... R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~149-~149: Possible typo: you repeated a word
Context: ... R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockst...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~150-~150: Possible typo: you repeated a word
Context: ...000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rocks...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~151-~151: Possible typo: you repeated a word
Context: ...0 Rebook Rebook NCCSO00 Red Red NCMN000 rockstar rockstar NCCS000 rockstars rockstar NCCP000 S S ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~153-~153: Possible typo: you repeated a word
Context: ...star NCCS000 rockstars rockstar NCCP000 S S AO0CN0 s s AO0CN0 Samsung Samsung AQ0CN...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~168-~168: Possible typo: you repeated a word
Context: ...a spalla NCFS000 spallas spalla NCFP000 Sr Sr NCMS000 Sra Sra NCFS000 Sras Sra NCFP00...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~255-~255: Possible typo: you repeated a word
Context: ...ção NCFP000 Whirlpool Whirlpool NPFS000 Woodcock Woodcock NPMS000 WritingTool WritingTool NPMN000...

(ENGLISH_WORD_REPEAT_RULE)

🔇 Additional comments (4)
languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt (2)

Line range hint 1-6: LGTM: File header and format

The file header correctly documents:

  • The purpose of the file
  • The source of the dictionary
  • The file encoding
  • The entry format
🧰 Tools
🪛 LanguageTool

[duplication] ~12-~12: Possible typo: you repeated a word
Context: ...iverso DI0MP0 diversos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~13-~13: Possible typo: you repeated a word
Context: ...sos diverso PI0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo D...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~14-~14: Possible typo: you repeated a word
Context: ...I0MP000 embaixo embaixo RG idem idem RG mega mega RG mesmíssima mesmíssimo DD0FS0 mesmíss...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~17-~17: Possible typo: you repeated a word
Context: ...mo DD0FS0 mesmíssimas mesmíssimo DD0FP0 mesmíssimo mesmíssimo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK...

(ENGLISH_WORD_REPEAT_RULE)


[uncategorized] ~19-~19: Consider adding a comma between these interjections.
Context: ...mo DD0MS0 mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquí...

(OK_OK_COMMA)


[uncategorized] ~20-~20: Consider adding a comma between these interjections.
Context: ... mesmíssimos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0...

(OK_OK_COMMA)


[uncategorized] ~21-~21: Consider adding a comma between these interjections.
Context: ...mos mesmíssimo DD0MP0 OK OK RG Ok ok RG ok ok RG pouquíssima pouquíssimo DI0FS0 pouqu...

(OK_OK_COMMA)


[duplication] ~24-~24: Possible typo: you repeated a word
Context: ... DI0FS0 pouquíssimas pouquíssimo DI0FP0 pouquíssimo pouquíssimo DD0MS0 pouquíssimo pouquíssimo PI0MS000...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~25-~25: Possible typo: you repeated a word
Context: ...o DI0FP0 pouquíssimo pouquíssimo DD0MS0 pouquíssimo pouquíssimo PI0MS000 pouquíssimos pouquíssimo DI0MP...

(ENGLISH_WORD_REPEAT_RULE)


Line range hint 214-257: Review manual additions section

The manual additions section includes several Brazilian Portuguese variants (e.g., 'sistêmico'/'sistémico'). This is good practice for supporting both European and Brazilian Portuguese. However, ensure that:

  1. All entries maintain consistent formatting
  2. Proper names are correctly tagged with NP prefix
  3. Compound words use consistent hyphenation
✅ Verification successful

Manual additions section verified - formatting and tagging are consistent

The verification shows:

  1. All proper names (starting with capital letters) are correctly tagged with appropriate prefixes:
    • Person names use NP prefix (e.g., "Jumarie NPMS000", "Tolstoy NPMSS00")
    • Brand names and other proper nouns use NC prefix (e.g., "Whirlpool NPFS000")
  2. Compound words show consistent hyphenation patterns:
    • All compound words maintain the same format between their entry and base form
    • Consistent part-of-speech tagging (NC prefix) for compound nouns
  3. The Brazilian/European Portuguese variants (e.g., 'sistêmico'/'sistémico') maintain consistent formatting and tagging
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Verify consistent tagging of proper names
rg "^[A-Z][a-zA-Z]+\s+[A-Z][a-zA-Z]+\s+(?!NP)" 

# Verify hyphenation consistency in compound words
rg "(\w+)-(\w+)\s+\1-\2\s+\w+"

Length of output: 568


Script:

#!/bin/bash
# Check proper name entries (starting with capital letter) and their tags
rg --pcre2 "^[A-Z][a-zA-Z]+\s+[A-Z][a-zA-Z]+\s+\w+" languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt

# Check compound words with hyphens
rg "-" languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/added.txt

Length of output: 1972

🧰 Tools
🪛 LanguageTool

[duplication] ~252-~252: Possible typo: you repeated a word
Context: ...Tolstoy NPMS000 Tolstoy Tolstoy NPMSS00 videogravação videogravação NCFS000 videogravações videogravação NC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~254-~254: Possible typo: you repeated a word
Context: ...00 videogravações videogravação NCFP000 Whirlpool Whirlpool NPFS000 Woodcock Woodcock NPMS000 Writi...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~255-~255: Possible typo: you repeated a word
Context: ...ção NCFP000 Whirlpool Whirlpool NPFS000 Woodcock Woodcock NPMS000 WritingTool WritingTool NPMN000...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~256-~256: Possible typo: you repeated a word
Context: ...lpool NPFS000 Woodcock Woodcock NPMS000 WritingTool WritingTool NPMN000

(ENGLISH_WORD_REPEAT_RULE)

languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/spelling.txt (2)

1-1: LGTM!

The header clearly indicates the purpose of this section.


309-309: Track temporary solution and improve comment style

  1. The comment indicates these additions are temporary "until he fixes the dictionary building." Consider:

    • Creating a tracking issue for the dictionary building fix
    • Setting a timeline for moving these words to their proper location
  2. Consider rephrasing the comment to be more formal:

-# Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dictionary building.
+# Temporary additions pending dictionary building system fix (Added by Marco A.G.Pinto)

Let's check if there's an existing issue tracking this:

🧰 Tools
🪛 LanguageTool

[style] ~309-~309: Consider using the more polite verb “ask” (“tell” implies ordering/instructing someone).
Context: ...yod # Added by Marco A.G.Pinto. Pedro told me to add here until he fixes the dicti...

(TELL_ASK)

Comment on lines 84 to 86
desenrolar desenrolar NCMN000
direct direct NCMS000
directs directs NCMP000
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Fix inconsistent verb entries

The entries for 'direct' and 'directs' appear to be incorrectly tagged as nouns (NC). Since these are English verbs, they should either be removed or properly categorized if they're valid Portuguese terms.

-direct	direct	NCMS000
-directs	directs	NCMP000
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
desenrolar desenrolar NCMN000
direct direct NCMS000
directs directs NCMP000
desenrolar desenrolar NCMN000
🧰 Tools
🪛 LanguageTool

[duplication] ~84-~84: Possible typo: you repeated a word
Context: ...d NCFP000 desenrolar desenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed N...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~85-~85: Possible typo: you repeated a word
Context: ...esenrolar NCMN000 direct direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~86-~86: Possible typo: you repeated a word
Context: ... direct NCMS000 directs directs NCMP000 Ed Ed NCMSS00 Eds Ed NCMPS00 EF EF NCCS000 ep...

(ENGLISH_WORD_REPEAT_RULE)

Comment on lines +145 to 148
quentinhos quentinho NCMP000
R$ R$ NCCP000
R$ R$ NCCS000
R$ R$ NCMP000
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Consolidate duplicate currency symbol entries

The entries for 'R$' are duplicated with different number/gender tags. Consider consolidating these into a single entry with an appropriate tag for currency symbols.

-R$	R$	NCCP000
-R$	R$	NCCS000
-R$	R$	NCMP000
-R$	R$	NCMS000
+R$	R$	Zm
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
quentinhos quentinho NCMP000
R$ R$ NCCP000
R$ R$ NCCS000
R$ R$ NCMP000
quentinhos quentinho NCMP000
R$ R$ Zm
🧰 Tools
🪛 LanguageTool

[duplication] ~145-~145: Possible typo: you repeated a word
Context: ...ho NCMS000 quentinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ ...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~146-~146: Possible typo: you repeated a word
Context: ...ntinhos quentinho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Reb...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~147-~147: Possible typo: you repeated a word
Context: ...nho NCMP000 R$ R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCC...

(ENGLISH_WORD_REPEAT_RULE)


[duplication] ~148-~148: Possible typo: you repeated a word
Context: ... R$ NCCP000 R$ R$ NCCS000 R$ R$ NCMP000 R$ R$ NCMS000 Rebook Rebook NCCSO00 Red Red N...

(ENGLISH_WORD_REPEAT_RULE)

@@ -115,26 +122,35 @@
Marvel Marvel AQ0CN0
Mastercard Mastercard AQ0CN0
MasterCard MasterCard AQ0CN0
média média:2 NCMP000
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Fix malformed tag in média entry

The entry média média:2 NCMP000 contains a malformed base form with a :2 suffix, which doesn't follow the established format.

-média	média:2	NCMP000
+média	média	NCMP000
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
média média:2 NCMP000
média média NCMP000
🧰 Tools
🪛 LanguageTool

[duplication] ~125-~125: Possible typo: you repeated a word
Context: ...MasterCard AQ0CN0 média média:2 NCMP000 mirrorless mirrorless AQ0CN0 oper oper NCCS000 opers oper NCC...

(ENGLISH_WORD_REPEAT_RULE)

@@ -4,22 +4,22 @@
# This does not add words to the spell checker, see hunspell/spelling.txt for that.
# File Encoding: UTF-8
# Format: three tab-separated fields: fullform baseform postags

#
conosco comigo PP1CPO00
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codebase verification

Base form mapping for 'conosco' is incorrect

The entry conosco comigo PP1CPO00 is incorrect. From the entities file (misc.ent), we can see that 'conosco/connosco' and 'comigo' are different tonic pronouns. The tag PP1CPO00 indicates a first-person plural (CP) oblique (O) personal pronoun, while 'comigo' is first-person singular. The base form for 'conosco' should be 'nós' (the first-person plural pronoun) instead of 'comigo' (first-person singular).

🔗 Analysis chain

Verify the pronoun tag for 'conosco'

The entry conosco comigo PP1CPO00 maps 'conosco' to 'comigo' as its base form. This seems incorrect as they are different pronouns - 'conosco' means "with us" while 'comigo' means "with me".

Also applies to: 22-22


Let me gather more specific information about Portuguese pronouns in the dictionary.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Search for other pronoun entries to verify the correct base form pattern
rg -A 1 "PP1CPO00|PP1CSO00" 

Length of output: 4189


Script:

#!/bin/bash
# Search for entries with 'conosco', 'comigo', 'connosco' and similar pronouns
rg "conosco|comigo|connosco" languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/

# Also check the morphological tag documentation
cat languagetool-language-modules/pt/src/main/resources/org/languagetool/resource/pt/tagset.txt || true

Length of output: 5427

@marcoagpinto marcoagpinto merged commit b333640 into master Dec 13, 2024
5 checks passed
@marcoagpinto marcoagpinto deleted the lt_marcoagpinto_20241213_0628 branch December 13, 2024 07:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant