Skip to content

Search improvements with punctuation in Chinese #3229

@Guybrush88

Description

@Guybrush88

As reported by LeviHighway on the wall:

Should punctuation marks be counted in the character count? In Chinese, for example, in Word, each punctuation mark counts as one character (and when we wrote essays as students, punctuation was also included in the character count). On Tatoeba, the Chinese enumeration comma (、) and period (。) are counted as one character, but question marks (?), exclamation marks (!), colons (:), and semicolons (;) are not counted. I think the Chinese counting standard should be improved.

Additionally, since search engines do not ignore Chinese periods, when searching for "瑪莉"$, only results like "……瑪莉?" can be obtained, while "……瑪莉。" cannot be found. This should be considered a bug.

https://tatoeba.org/it/wall/show_message/41506#!#message_41506

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementIssue that describes a problem that requires a change in the current functionalities of Tatoeba.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions