HS
HS
LLangfuse
Created by HS on 11/14/2023 in #get-support
Extended scores
Hey @Marc , checking to see your thoughts on my last comment. Thanks
3 replies
LLangfuse
Created by HS on 11/14/2023 in #get-support
Extended scores
Thanks @Marc . so in the first case lets say 2 source types added a same score. e.g Human 1 and Human 2 rate score say accuracy. Will. there be a way to get distribution ? Also can we add additional metadata in the score for example which algorithm and version / tool used for evaluation and how long did it take run the algo or complete the human evaluation. This will further help observing the behavior of evaluators (score types). In addition another feature could be adding type of score value such as numeric or category such that the tool can show the the corresponding summary stats.
3 replies