Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Indexing causes "Data too long for column 'word'" #247

Closed
drn05r opened this issue Sep 13, 2022 · 1 comment
Closed

Indexing causes "Data too long for column 'word'" #247

drn05r opened this issue Sep 13, 2022 · 1 comment
Assignees
Labels
bug Something isn't working
Milestone

Comments

@drn05r
Copy link
Contributor

drn05r commented Sep 13, 2022

This is a similar is #190 but for regular rather than Xapian indexing. This is due to the word field in eprint__rindex being a VARCHAR(128). That is not unreasonable as it is unlikely that a word that might be searched on would be longer than that number of characters. The issue is not due to multi-bytes words, it is due to MariaDB 10.3+ complaining about inserting words longer than 128 characters to eprint__rindex rather than just truncating. Neither truncating or error is appropriate. It is unlikely to be useful to index a word greater than 128 characters even it if is possible, so these should be excluded from indexing.

@drn05r drn05r added the bug Something isn't working label Sep 13, 2022
@drn05r drn05r added this to the 3.4.5 milestone Sep 13, 2022
@drn05r drn05r self-assigned this Sep 13, 2022
@drn05r drn05r closed this as completed in 01da0b5 Sep 13, 2022
@drn05r
Copy link
Contributor Author

drn05r commented Apr 7, 2023

This original fix would allow short words and stop words to be indexed when before they would not. $ok should only be set changed (from 1 to 0) if the word lentgth is too long.

@drn05r drn05r reopened this Apr 7, 2023
drn05r added a commit that referenced this issue Apr 7, 2023
@drn05r drn05r closed this as completed Apr 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant