• @[email protected]
    link
    fedilink
    385 months ago

    Well that’s part of the thing. Web scraping doesn’t get covered by policies. Like, they could ban your ip or any accounts you have, but web scraping itself will always be acceptable. It’s why projects like NewPipe and Invidious don’t care about YouTube cease and desist letters.

    • @[email protected]
      link
      fedilink
      35 months ago

      Oops look like this community hasn’t been reviewed. Login if you still want to see the content.

      • folkrav
        link
        fedilink
        15 months ago

        Parsing absolutely comes with a lot more overhead. Especially since many websites integrate a lot of JS interactivity nowadays, you oftentimes don’t get the full contents you’re looking for straight out of the HTML you’re getting out of your HTTP request, depending on the site.

      • @[email protected]
        link
        fedilink
        English
        15 months ago

        In what way?

        HTML definitely provides more overhead than json if you only care about the data.