Source: https://front-end.social/@fox/110846484782705013

Text in the screenshot from Grammarly says:

We develop data sets to train our algorithms so that we can improve the services we provide to customers like you. We have devoted significant time and resources to developing methods to ensure that these data sets are anonymized and de-identified.

To develop these data sets, we sample snippets of text at random, disassociate them from a user’s account, and then use a variety of different methods to strip the text of identifying information (such as identifiers, contact details, addresses, etc.). Only then do we use the snippets to train our algorithms-and the original text is deleted. In other words, we don’t store any text in a manner that can be associated with your account or used to identify you or anyone else.

We currently offer a feature that permits customers to opt out of this use for Grammarly Business teams of 500 users or more. Please let me know if you might be interested in a license of this size, and I’II forward your request to the corresponding team.

  • @[email protected]
    link
    fedilink
    111 months ago

    there’s a lot in this comment and i don’t want to engage with every little thing in there so lets focus in on what we are actually talking about: is AI theft?

    the argument i’ve made multiple times is not that the technology as a concept can only be used to steal, but that the people developing, using and monetizing it are stealing.

    I made sure to exclude individual users from this framing because the understanding that allows us to consider AI (not as a kind of matrix math, but as a technology primarily mobilized by huge incredibly wealthy companies with monetary interest in its deployment and use) as theft is unconcerned with individual users changing their relationship to creative technology and its output.

    to restate that argument and get us back on track: AI (once again, not the matrix math, but the set of interconnected technologies built around a proprietary form of it that are being mobilized for profit by enormous incredibly powerful companies) is theft because the people in control of it steal. the closed technology of AI is predicated on open and free access to data for training without any pretense of compensation made to the creators of that data, the maintainers of resources like apis, servers and databases used to access that data or any expectation that the end model will ever be made public. the technology of AI is intended to be used to generate profit, and as such those in control of it have every motive to avoid compensating the people who created, hosted, cataloged and made available the training data or releasing the model to the public. it represents a leverage of public goods for private gain in very explicit terms and this understanding is even part of the pitch to investors.

    Now lets look at the effects that AI (once again, the interlocking set of technologies used by google et al. not the concept of matrix mathematics) has had just in the past year because of this relationship to public data. hosts of historic information acting as public goods have incurred significant costs, sources of information once kept open have beed gated to prevent crawling and access to “content” previously viewed as simply fodder for ad draws has been restricted because of it’s new status as training data.

    That last one should have particular resonance given that reddit planned to monetize peoples posts and comments as training data to sweeten their IPO.

    now we need to talk about the word theft because we both recognize that its’ particular legal meaning isn’t really applicable to this situation and that western legal verbiage has actually avoided enclosure in general. it would be very easy to simply take a pedantic reading of the word theft and then say “it’s not theft for that reason!”, but neither you nor I will do that because we are interested in discussing this new technology and the developments around it. in lieu of saying “monetizing the commons and taking actions that lead to their restriction” every time we need to talk about whats happening, we need a word that means roughly that.

    historically, we can see many examples of the word “theft” used in that way, all the way back to the enclosures but even farther if we like. so because of it’s historical use and broad meaning in the vernacular we can choose theft as the word to mean “monetizing the commons and taking actions that lead to their restriction”.

    Now that i’ve restated the argument and defined the terms, lets take a look at your claim that AI is no different than any other technology using your own examples:

    The printing press did not monetize the commons or restrict them. an argument could be made that its widespread use lowered the barrier to entry for massive amounts of human knowledge and literacy in general, so it actually had the opposite effect. The automobile did act as an enabler for the monetization and enclosure of the commons through the development of the highway system and jaywalking laws, restricting roadways to those who could afford cars. so that could be a good example, but you don’t invoke it, choosing instead to reference its effect on leather-working industries. word processors, either as physical machines or software programs do not monetize or enclose the commons on their own, however a person could make the (i think weak) argument that proprietary file formats are a form of enclosure. You don’t though instead claiming that the word processor put secretaries out of work.

    so, it seems like AI (hopefully for the last time: as a set of technologies leveraged to control and monetize public goods, not matrix mathematics) is significantly different than past technological advances but very similar to some events.

    • arglebargle
      link
      fedilink
      English
      111 months ago

      I stand by what I said. Your first paragraph said it too: AI is not theft. It just is math. The problem is not the technology: it is the corporate interests. So AI itself is not theft.

      Buying all the local land in Mexico that previously were surf spots/fishing villages/local hangouts/town squares, and converting them to monoculture farms for export is a similar idea where corporations leverage government to the detriment of people and environment. But we cant blame the technology, its the rampant corportization.

      So we are not that far apart, but the issue is not math. It is how it is used.

      • @[email protected]
        link
        fedilink
        111 months ago

        Hey I’m not gonna do two comment threads with you when you can’t even stick to the one we have going. If you wanna stick with this one, reply to this comment and let the other one go. I’ll post this preface in my response to your other comment too.

        I have held that AI (parenthetical explanation) is theft. You have tried over and over again to put words in my mouth, please stop. I’m trying hard to engage in good faith with you.

        In your example the technology of manipulating all levels of government to gain control of massive tracts of land against the wishes of owners, residents and users of the area (and to the detriment of nature) is theft. Not in a legal sense, because the legal system has no interest in upholding the rights of those left out of that process, but in a common use sense.

        That technology could be used by individuals and perhaps not be called theft but it’s not. It’s always deployed to the benefit of capital.

        Do some reading on the luddites. They’re very interesting!

    • arglebargle
      link
      fedilink
      English
      111 months ago

      By the way theft implies something taken. I responded by saying tech takes away jobs all the time. You suggested that providing services based on publicly available data is some how stealing is absurd. Nothing was taken away. So I tried to relate it to jobs. You said the commons are effected. Yet they are not, as nothing it taken away. You said that coroporations are the bad ones, but local instances are not - so I can run my own instances and do not use the corporate ones, so what was taken?

      I just do not understand this as theft. It is absurd.

      • @[email protected]
        link
        fedilink
        111 months ago

        Hey I’m not gonna do two comment threads with you when you wont even stick to the one we have going. If you wanna stick with this one, reply to this comment and let the other one go. I’ll post this preface in my response to your other comment too.

        I explained in detail how AI (in almost every case parenthetically explaining like this that I’m not talking about matrix mathematics, but a complex set of interlocking technologies mobilized by powerful companies to profit off of public work) affects our commons.

        I also explained how the formal western legal holding of the word theft isn’t useful in this case, but in lieu of a word that has more pertinence, the widely used vernacular of theft is perfectly fine.

        Now I need you to recognize that in light of those explanations, being precious about the words particular meaning instead of talking about the actual problem at hand amounts to deploying pedantry to avoid weighing in on the dispute.

        So maybe let’s start from the start: what do you think should be done in light of the effect that AI (long explanation that I’m not talking about matrix mathematics here) is having?