• ABC123itsEASY@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    3 days ago

    I some beef with this meme in that there really isn’t a way to simply do this in windows. If anything, it demonstrates the upper level of capability and function using a cli shell. People who are looking for a windows replacement would never need to understand this command or even use a pipe / regex as they were unlikely to have been doing this kind of thing with windows anyway.

      • ABC123itsEASY@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        3 days ago

        I could imagine a command like this being used as part of a CI/CD script doing static analysis in a virtualized environment where the build is running in a *nix container. There’s more maintainable options as well (ie easier for an entire team of developers to understand / lower ‘bus factor’).

        • Justin@lemmy.jlh.name
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          2 days ago

          Build scripts are often written in bash, yes, but I would say that you should find a utility program, or write your own utility in python, if you’re breaking out sed. It’s very hard to read code like this, no matter the team size.

          There’s probably only 100-300 usages of sed in the entire nixpkgs repo, with over 100,000 packages.

          I definitely agree Linux is easier to maintain and build code on than Windows, but yeah abusing sed is not really an ideal use case 😅

  • 👍Maximum Derek👍@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 days ago

    The first language I was fluent in was Perl so PCRE is second nature to me. But then everyone decided they wanted their own regex dialects. And now there’s a PCRE2? Why 2? Stay with 1, you’re good together. What about the kids?

  • mogoh@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    3 days ago

    Which one of these commands is correct?

    A: sed -E 's/\b(\w+)\b/echo \1 | rev/g' file.txt
    B: sed 's/\b\w+\b/echo & | rev/ge' file.txt
    C: sed -E 's/(\w+)/$(echo \1 | rev)/g' file.txt
    D: sed 's/\([a-zA-Z]\+\)/\n&\n/g; s/\n\(.*\)\n/\3\2\1/g; s/\n//g' file.txt

    Chatty was so kind to transcribe. May contain errors.

    • mogoh@lemmy.ml
      link
      fedilink
      arrow-up
      0
      ·
      3 days ago

      Chatty claims the correct answer to be:

      Spoiler

      B

      I tried it my self and I conclude:

      Spoiler

      none is correct.

      • UltraBlack@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        2 days ago

        Thought so lol

        A: didn’t even try what by does B: Single quotes prevent execution C: there is no way to execute commands afaik so this won’t work either D: that syntax is just wrong afaik

    • Onno (VK6FLAB)@lemmy.radio
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      3 days ago

      Google Lens says:

      Which one of these commands is correct?
      
      A sed -e 's/\b(\w+)\b/echo \1 | rev/g' file.txt
      
      B: sed 's/b\w+\b/echo & | rev/ge' file.txt
      
      Csed -e 's/(\w+)/$(echo \1 | rev)/g' file.txt
      
      D: sed 's/([a-zA-Z]\+\)/\n&\n/g; s/\n\(\)\(.*\)\(\)\n/\3\2\1/g; s/\n//g' file.tx
      

      It’s interesting that Google doesn’t even get all the text. I had to manually extend the selection and that still misses the “t” on the end of answer D, munches C and more alarmingly changes the case for “-E”.

        • morrowind@lemmy.ml
          link
          fedilink
          arrow-up
          0
          ·
          2 days ago

          OCR was AI.

          Anyway today’s models are measurably better especially when you go beyond simple text on a clean page.

        • vivendi@programming.dev
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          Any good OCR model also uses “AI”

          And LLMs are usually really good at detecting text

          Source: Had to OCR a quite a few ancient university papers

  • foggy@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    3 days ago

    Yo ill be 100 wjth you.

    Regex is where something kike an LLM excells.

    Don’t rely on an llm for coding, but… This is exactly where it should be in your toolbox.

    • ABC123itsEASY@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      3 days ago

      Lol why are you getting downvoted this isn’t even a hot take. You are 100% right regex is famously enigmatic even among experienced software engineers.

      • foggy@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        3 days ago

        Yeah Lemmy used to have a core of tech Intel and that has slipped hard in the last 6 months.

        Be what it do I guess. Dummies gonna dumb.

        We are in this sea of like a million people who want to be cybersecurity professionals…

        …and as a cybersecurity professional it’s adorable when I see vehement dissent.

        Like y’all, I’ve been doing this. And if you want a recommendation, pipe down lol.

        • ABC123itsEASY@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          2 days ago

          Yea I come from the generation of reddit departures that left because of API lockdown and elimination of third party apps. Nowadays a lot of people join Lemmy because they got banned off of reddit for reasons of varying respectability. I would say it’s diluting the concentration of tech intel, as you say. Oh well.

          • foggy@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            2 days ago

            Lol yep. Also here from the reason for which you only care about a lot if you have done some kind of web develooment.

            Edit: Jesus I just reread that. I literally just ripped the bong. Was a dumb sentence. I’ll leave it.

      • foggy@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        3 days ago

        A lot of lemmy is very anti-Ai. As an artist I’m very anti-Ai. As a veteran developer I’m very pro AI (with important caveats). I see it’s value; I see it’s threat.

        I know I’m not in good company when I talk about its value on Lemmy.

        • Natanox@discuss.tchncs.deOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          Completely with you on this one. It’s awful when used to generate “art”, but once you’ve learned its short-comings and never blindly trust it it is such a phenomenal help in learning and assisting with code or finding something you’ve a hard time to find the right words for. And aside from generative use-cases neural networks are also phenomenally useful for assisting tasks in science, medicine and so on.

          It’s just unfortunate we’re still in the “find out” phase of the information age. It’s like with the industrialization ~200 years ago, just with data… and unfortunately the lessons seem to be equally rough. All the generative tech will deal painful blows to our culture.

          • JayDee@lemmy.sdf.org
            link
            fedilink
            arrow-up
            0
            ·
            3 days ago

            That’s a view from the perspective of utility, yeah. The downvotes here are likely also from a ethics standpoint, since most LLMs currently trained are doing so by using other peoples’ work without permission, all while using large amounts of water for cooling, and energy from our mostly coal-powered grid. This is also not mentioning the physical and emotional labor that many untrained workers are required to do when sifting through the datasets of these LLMs, removing unsavory data for extremely low wages.

            A smaller, more specialized LLM could likely perform this same functionality with a much less training, on a more exclusive data set (probably only a couple of terabytes at its largest I’d wager), and would likely be small enough to run on most users’ computers after training. That’d be the more ethical version of this use case.

    • circuitfarmer@lemmy.sdf.org
      link
      fedilink
      arrow-up
      0
      ·
      3 days ago

      I don’t disagree with this hot take. But the major difference is the sheer resources needed to have an LLM in place of a “do one thing right” utility like sed. In that sense, they are incomparable.

      • foggy@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        3 days ago

        I mean fair.

        I guess the caveat here should be fucking learn regex first, lamo.

        Don’t use it works not necessary. Google is probably still better if you’re looking for regex for an email or something like that

        And also don’t just rely on its answer for prod.

      • bus_factor@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        3 days ago

        I think they’re arguing for having the LLM generate the regex. And I certainly would not trust an LLM to do that right.

        • Natanox@discuss.tchncs.deOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          Yeah, it’s way more sensible to use some of the available regex utilities like this. Although it’s always funny to see what an LLM comes up with.

    • bus_factor@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      3 days ago

      I don’t see anything wrong with the capture groups in A and C. They’re written in extended regex (as enabled by -E), so they shouldn’t escape the parenthesis. Am I missing something?