• GingaNinga
    link
    fedilink
    10
    edit-2
    29 days ago

    correlation and causation. even useless stats comparing apples and oranges, the numbers generated are only as good as the study design and methods.

  • @aislopmukbang@sh.itjust.works
    link
    fedilink
    English
    829 days ago

    Blindly. People love to list them as evidence as if the numbers stand on their own. Reality is a person had some hand in assembling the numbers and there is no such thing as a bulletproof statistic. Good statistics ought to be scrutinized.

    • As a math guy, I hate when people say statistics is math. Like yeah, there are equations, and math plays a role, but the results so often speak more to the selection and interpretation choices made by the statistician than to any kind of mathematical rigor.

  • Fargeol
    link
    fedilink
    829 days ago

    By training an algorithm that will have an impact on said statistics. Not only the algorithm can cheat (see Goodhart’s law), but it can repeat biases that led to these statistics (like those law enforcement algorithms that became racists)

    • @Apytele@sh.itjust.worksOP
      link
      fedilink
      1229 days ago

      My favorite was the one they were training to detect cancer in imaging scans but they forgot to edit out the info stamp in the corner so it just started flagging all the scans from the cancer center!

        • @Ideonek@lemm.ee
          link
          fedilink
          229 days ago

          A similar case was with scans from “mobile scanners”. Since those are used on patients to sick to be transported, their cases were disproportionaly “malicious”. Model was effectively optimozed to detected if scaner was stationary or mobile.

    • @Ideonek@lemm.ee
      link
      fedilink
      329 days ago

      You are describing Google Ads right now. Algorithms are better and better in reaching to poeple that are already on the purchase patch. It’s like giving a restaurant flayers to people that are waiting for a weiter to show them a table.

      Aren’t our ads amazing? Look, almost everyone who saw them made the purchase!

      Analytics that ignores Goodharts law ruin everything. Movies, HR, Marketing (not much to ruin left, but you get the point), performancet review, recommendations…

  • @General_Effort@lemmy.world
    link
    fedilink
    629 days ago

    …as a drunken man uses lamp posts — for support rather than illumination.

    The question makes me remember Daryl Bem, a celebrated social psychologist. He published a much cited article called “Writing the Empirical Journal Article”. About 15 years ago, he used this advice to prove that humans can see into the future. His advice is probably still used to teach. That’s probably the worst thing you can do.

  • @untorquer@lemmy.world
    link
    fedilink
    628 days ago

    Averages. They’re almost always a bullshit flag if it’s tied to anything remotely political. If you’re not going to also give the standard deviation and skew then at least use median.

  • ℕ𝕖𝕞𝕠
    link
    fedilink
    429 days ago

    Well, to immanetize the eschaton. That’s the worst thing to do with statistics.

  • @Freshparsnip@lemm.ee
    link
    fedilink
    English
    329 days ago

    I once saw a reddit post where some busybody counted how many people with dogs walked by in an hour and multiplied that by 24 and assumed that was how many walked by in a day (as if it would be the same amount at all times of day)