As the tech platform prepares for a public offering, it says that most of its revenue is from advertising, but touts artificial intelligence as a growth area.
Makes sense. Reddit really is a data gold mine. I shudder to think about the profile you could have built of me from my up/down votes alone, much less my comments and posts. FOMO and participation always generating data really just has me wondering if we'll see a world where "privacy" becomes extinct or culturally different.
Yeah kinda the reason I stopped contributing even with votes. No more account login, just casually browse/lurk. Even if you periodically delete your account all that data is probably saved somewhere, there's no way to be sure.
Yeah, I'm doing it again on lemmy. I'm definitely not immune in any way to the FOMO and lack of data discipline. We all are, in some ways, even just by being here on his platform.
You should be paid for that. There is a valid lawsuit to be had. Reddit's terms and conditions do not absolve them of value theft from the content you posted.
Ehhh, I'm not super persuaded by that argument tbh. I don't think arbitrary data has any intrinsic value on its own the way copyright-able art does. I work in tech and I'm just not sure I want that can of worms opened.
Edit: I guess I should say, it's not something I've dedicated a lot of thought too. I'm open to arguments to the contrary.
Everything is scraped for training data. They argue that this is fair use under the "research" exemption. However, it is not research, the datasets they build are private and used exclusively for commercial product development.
Even if you could consider it as fair use research - which it isn't - the commerciality of it should exclude it from being fair use.
The Reddit IPO appears set to move forward, with the tech platform filing its form S-1 with the Securities and Exchange Commission on Thursday.
The form lists a variety of new details about the site, including its financials, risk factors, and key business lines.
“Our content is particularly important for artificial intelligence (‘AI’) – it is a foundational part of how many of the leading large language models (‘LLMs’) have been trained,” the company writes in the S-1.
We expect our data advantage and intellectual property to continue to be a key element in the training of future LLMs.”
News reports have pegged the partner as Google, which is using Reddit data to train its Gemini LLM.
According to the filing, Huffman’s total compensation in 2023 was $193.2 million, though that was almost all in the form of stock and option awards, which may or may not vest, depending on the company’s performance.
The original article contains 443 words, the summary contains 153 words. Saved 65%. I'm a bot and I'm open source!