Skip to content

Update tokenizer.py#137

Open
mingchen-li wants to merge 1 commit into
c2nes:masterfrom
mingchen-li:tokenize-return-index
Open

Update tokenizer.py#137
mingchen-li wants to merge 1 commit into
c2nes:masterfrom
mingchen-li:tokenize-return-index

Conversation

@mingchen-li

Copy link
Copy Markdown

The tokenize function can return index now. It would be helpful for developing https://huggingface.co/docs/tokenizers/api/pre-tokenizers for Java.

The tokenize function can return index now
LoRexxar added a commit to LoRexxar/javalang that referenced this pull request May 22, 2026
c2nes#137)

Feature 实现:
- c2nes#114/c2nes#41/c2nes#26: 保留子表达式中 prefix/postfix operators
- c2nes#88: Java 8 receiver parameter 支持 (Inner.this 参数)
- c2nes#133: 添加 JavaVisitor 类 (visitor.py)
- c2nes#100: Position namedtuple 添加 range 字段 (slice)
- c2nes#137: tokenize() 添加 return_index 参数

新增文件:
- javalang/visitor.py: Visitor pattern 实现
- javalang/test/test_upstream_features.py: 15 个 feature 测试

测试: 112/112 全部通过
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant