Skip to content

Monochromatic scatterplots are misleading #59

@longouyang

Description

@longouyang

Reproducing a Slack conversation:

[long] the problem is something like this
when you approximate a continuous distribution by sampling
all the samples have the same score
(if you do the aggregation where score = frequency of sampled value)
if you then visualize this, points will tend to overplot and it will be hard to see the real signal without intelligently setting opacity levels
we can get around this for base continuous distributions by just using the value from the primitive scorer
instead of the frequency score

[ndg] ah!
makes sense

[long] but for continuous marginals, we’re hosed

[ndg] yeah.... at some point (maybe soon?) we'll add kernel-based aggregator, which will help with this.

[long] ah, good to know

[robert_hawkins] that’ll be great

[long] at the moment, i haven’t finished fixing the problem for even base continuous distributions yet. hence the advice to distrust monochromatic scatter plots

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions