Skip to content

Update age normalization method in frequency based model evaluator #7

@xj2193

Description

@xj2193

The current normalization method for age is applied on the entire dataset before splitting up which might cause age information leakage for the future so it isn't aligned with the best practice. The method needs to be updated to make the evaluations fair across train/test/validation sets.

For frequency baseline models, we need to "STOP" normalizing age in the corresponding evaluators where we process the data for evaluation.

https://github.com/cumc-dbmi/cehr-bert/blob/8be39f18cfbfba0f3905110bdf6a2e0fa289ff08/evaluations/model_evaluators.py#L406

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingenhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions