The developers of the Manjaro Linux distribution, built on the basis of Arch Linux and aimed at beginners, announced the beginning of testing a new service MDD (Manjaro Data Donor), designed to collect statistics about the system and send it to the external server of the project. The author of the MDD intended to enable telemetry by default (opt-out), but the decision has not yet been approved and, judging by the objections of some developers and users, it is likely that telemetry will be offered as an option requiring prior consent of the user (a request to enable telemetry is proposed to be added to the greeting interface after the first download).

The report includes data such as host name, kernel version, desktop component versions, detailed information about hardware and drivers involved, screen size and resolution information, network device MAC addresses, disk serial numbers, disk partition data, information about the number of running processes and installed packages, versions of basic packages such as systemd, gcc, bash and PipeWire.

The sent data is stored on the project server in the ClickHouse database and visualized using the Grafana platform. The IP addresses of users are not stored, and the hash from the /etc/machine-id file is used as the system identifier.

Аccording to the code https://github.com/manjaro/mdd/blob/master/mdd.py#L40 sends everything.

  • Destide@feddit.uk
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    It amazes me it’s still as popular as it is and still own goaling at least once a year.

  • ShittyBeatlesFCPres@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    Why do they need half that data for a derivative of a distro? Fuck off. I don’t care if someone collects the model number of my GPU or whatever but that sounds like personally identifiable tracking data, not basic “telemetry” data to set development priorities or whatever.

  • Majestic@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    data such as host name,

    Okay why do they need to know that? Why do they need to know if the computer is called “Melissa’s Laptop” or “Workstation 15, Internal security division”? Seems like this kind of data could if stolen be misused and it has minimal legitimate purpose IMO as anyone can put anything as host name and while in organizations it often corresponds to use it doesn’t have to for individuals. Someone could call their machine “Mack’s Porn Rig” and they only use it for doing banking and a little coding.

    kernel version, desktop component versions, detailed information about hardware and drivers involved, screen size and resolution information,

    This all seems legitimate enough, this would be helpful for understanding the hardware their users run on and targeting features or bug fixes.

    network device MAC addresses,

    Not great but there is an argument for it, they could just grab and send the first 3-4 octets which would give them the info they need on manufacturers without getting uniquely identifiable data that along with some of this other stuff is concerning for fingerprinting.

    disk serial numbers,

    Okay, what the fuck. Why do they need disk serial numbers? What possible use is there for that. Those are used for warranty claims and could be used as part of uniquely fingerprinting a computer and person. Not cool.

    disk partition data,

    This is vague enough. I guess one could choose to see this as just info about partitions in use say if there’s also an NTFS partition that looks like a Windows install that would be useful but on the other hand data encompassed within a partition could also nefariously be read as allowing them access to all your data. Partition layout, partition labels, and file systems used on disks available to the system would be a clearer way to put this and erase any doubt.

    information about the number of running processes and installed packages, versions of basic packages such as systemd, gcc, bash and PipeWire.

    All this is also fine just technical data stuff.

  • HornedMeatBeast@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    I tried Manjaro last year and I hated it.

    Something about the distro would lock up my PC, it would freeze from time to time.

    I disabled the standby/sleep function, but allowed my monitors to go into standby. But if I left my PC for an hour or two my screens would not wake up, different types and brands. I had so many issues with Manjaro and while speaking with a friend I told him I had moved over to Nobara but he was still on Manjaro. But then a few weeks later he mentioned he was running Nobara. Seems he also ditched it.

    • HiddenLayer555@lemmy.ml
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      5 months ago

      Ad firm money.

      Maybe I’m just cynical, but my first instinct when I see stuff like this is they have a secret contract with an advertiser and are selling this information.

  • notprogrammer@programming.dev
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    The report includes data such as host name, kernel version, desktop component versions, detailed information about hardware and drivers involved, screen size and resolution information, network device MAC addresses, disk serial numbers, disk partition data, information about the number of running processes and installed packages, versions of basic packages such as systemd, gcc, bash and PipeWire.

    That’s insane

    • Buffalox@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      The MAC address is anonymized with sha256, and IP adresses aren’t stored.
      So this seems to me to be perfectly anonymous.

      • gnuhaut@lemmy.ml
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        MAC addresses are 48 bit, and half of that is just the manufacturer. So 24 bits really, and those bits aren’t random, I think manufacturers just assign these based on some scheme, like a serial number. Point is you could easily reverse the SHA by brute force.

        You can’t calculate any useful statistic from a hash so literally the only use this would have is some sort of tracking.

        • Buffalox@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          5 months ago

          this would have is some sort of tracking.

          It’s right at the top of the announcement, that it’s mainly for more accurate stats on unique users.
          It’s not that I think this is a good idea, because I don’t, but some people are blowing it out of proportions. Especially since this isn’t at all decided. Which I seriously doubt it will.

          • gnuhaut@lemmy.ml
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            5 months ago

            You don’t need this to count unique users. You could just assign a random number on install or whatever. Or even more simply, just run the thing once per month, should be accurate enough. Do they expect the software to just randomly spam duplicate reports? Don’t write it that way.

            Best case they don’t care about collecting minimal data and don’t understand that hashed MACs are easily reversible. So incompetent fools with no sensitivity to privacy.

            Maybe this should be Manjaro’s tagline: Not purposely malicious, just grossly negligent and ignorant.

            • Buffalox@lemmy.world
              link
              fedilink
              arrow-up
              0
              ·
              5 months ago

              You could just assign a random number on install or whatever.

              Funny, I thought the exact same thing.

      • GolfNovemberUniform@lemmy.ml
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        Why collect such data though? And you can call some Big Tech telemetry completely anonymous too if you trust their explanations.

        • Buffalox@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          5 months ago

          You can see the code of what is send.
          I’m not aware that Google claims they collect data anonymously, on everything where you are logged in.
          So that’s a false equivalence.

    • Bezier@suppo.fi
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Thought it’s probably fine after reading the title, but this shit isn’t fine. What the fuck.

  • SavvyWolf@pawb.social
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    Why do they need information about the hostname? Is it really valuable for them to know how many systems are named daves-pc?

    • d0ntpan1c@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      I moved one of my computers to endeavor, but one is still on manjaro and the contrast is kinda hilarious. Manjaro machine always gets funky after updates, it struggles to deal with sleep and hibernation, and it feels slow even when its like 4x as powerful as my EndeavourOS machine.

  • bruhSoulz@lemmy.ml
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    Glad i said fuck it and went straight to actual arch when i wanted to try arch based. Literally like 9/10 times i hear manjaro brought up its not going to be in praise. Ffs lol