You can call me AI

msumuh · April 20, 2023, 3:03pm

Do dogs see that and think it falls into the uncanny valley?
(For that matter, my dog seems to think all kinds of things are uncanny-valley-dogs, like the wire reindeer people put our for Christmas. They make her crazy.)

RickMycroft · April 20, 2023, 5:33pm

In AI development, the dominant paradigm is that the more training data, the better. OpenAI’s GPT-2 model had a data set consisting of 40 gigabytes of text. GPT-3, which ChatGPT is based on, was trained on 570 GB of data. OpenAI has not shared how big the data set for its latest model, GPT-4, is.

But that hunger for larger models is now coming back to bite the company. In the past few weeks, several Western data protection authorities have started investigations into how OpenAI collects and processes the data powering ChatGPT. They believe it has scraped people’s personal data, such as names or email addresses, and used it without their consent.

Sort of like Clearview AI, the creepy facial recognition company. They probably know each other.

vermes82 · April 21, 2023, 9:34am

Lost in translation

Following major reforms known as sote, at the beginning of this year, responsibility for healthcare services was shifted from 293 municipalities to 21 self-governing wellbeing services counties, along with the city of Helsinki.

The structural reform has led to language issues, reports Hufvudstadsbladet, including new managers who don’t speak Swedish, Finland’s other official language, in predominantly Swedish-speaking areas, particularly in the southwest.

The western Uusimaa wellbeing service county has struggled to translate its new material into Swedish, relying on machine translations that have resulted in some incomprehensible texts.

“For us it’s important that our staff can work in their mother tongue,” Sanna Svahn, the county director of western Uusimaa, told the Swedish-language daily.

The wellbeing service county is now seeking translation assistance from the Swedish Cultural Foundation, a private organisation.

mr_raccoon · April 21, 2023, 10:59am

(Reuters reprint)

mr_raccoon · April 21, 2023, 11:41am

RickMycroft · April 21, 2023, 4:54pm

AI companies: “We will scrape programming sites to make it look like our product can program!”
StackOverflow: “Hang on! We want to be paid first.”

SO contributors have entered the chat: “Wait a sec! Where’s our cut?”

sqlrob · April 21, 2023, 5:02pm

Given the quality and security bugs in a lot of the code I see there, you really, really don’t want unsupervised training on SO.

ChatGPT creates mostly insecure code, but won’t tell you unless you ask

ChatGPT, OpenAI’s large language model for chatbots, not only produces mostly insecure code but also fails to alert users to its inadequacies despite being capable of pointing out its shortcomings.

Amid the frenzy of academic interest in the possibilities and limitations of large language models, four researchers affiliated with Université du Québec, in Canada, have delved into the security of code generated by ChatGPT, the non-intelligent, text-regurgitating bot from OpenAI.

In a pre-press paper titled, “How Secure is Code Generated by ChatGPT?” computer scientists Raphaël Khoury, Anderson Avila, Jacob Brunelle, and Baba Mamadou Camara answer the question with research that can be summarized as “not very.”

[…]

FGD135 · April 22, 2023, 6:51am

What does an ex-Pharma Bro do next? If it’s Shkreli, it’s an AI Dr bot

[…]

On Thursday, Martin Shkreli, released last year from a seven-year stint in prison for securities fraud, announced the availability of Dr Gupta. The controversial entrepreneur described the project as “The world’s first physician chatbot,” in a tweet on Thursday.

[…]

The Register also asked, “Do you have any concern people may get bad advice from the model and follow that advice, leading to harm?”

That question went unanswered. Dr Gupta does come with a warning that the bot is not providing actual medical advice.

[…]

Interesting read.
Also, this had previously escaped my attention:

Shkreli, shortly after being let out of prison, last year launched Druglike, “a decentralized science (DeSci) drug discovery Web3 platform” that the company’s press release [PDF] insisted “is not a pharmaceutical company.”

RickMycroft · April 22, 2023, 12:31pm

After watching it fall apart in a few Chess games, I’m convinced that once it goes beyond its “opening book” of a large number of code examples, it will also fall apart in programming that involves more than boilerplate code generation. Any large project, where it has to keep a context, will be hopeless.

mr_raccoon · April 22, 2023, 12:58pm

That is the issue between these sort of general purpose AIs like GPT & Bard and the ones specifically trained to play chess. Going back to old ones like DeepBlue, IBM had limited power compared to today so they had to focus and gave them data originated from chess pros from over the course of decades and centuries. Ones like GPT get information from everywhere online including a ton of data from terrible players.

The language based models are really just very advanced autocomplete at the moment. From my experience with the Bard beta I noticed that firsthand. Giving completely wrong information about Destiny 2 weapons and what archetypes they belong in. Throwing in wrong information about platelets in the middle of factually accurate info (platelets do NOT clean plague from the walls of arteries of the heart). And it often times includes fan fiction stuff in questions about anime canon.

But what those AIs do well is picking up a general mood around a topic regardless if that mood is factually accurate which is both equal parts awesome and troubling. You need a turbo amount of media literacy for any sort of deep topical dives using AIs because of that at the moment. I wouldn’t be shocked to hear that political pollsters are using AI in such manner.

RickMycroft · April 22, 2023, 2:04pm

No, it’s not just that they have samples from bad players. Then it would play a bad but legal game, lose, possibly conceding when its position was hopeless. The problem is that it can’t play Chess.

It has such a poor grasp of the game’s context, that eventually the autocomplete pulls a fuzzy answer for a game close to but not the same as the current game, where it’s illegal. It’ll even change the list of previous moves.

This might be a class of problem that GPT and friends can’t handle, no matter how much the dataset is tweaked. (Wait for them to cheat and add a Chess plug-in to hide the problem.)

mr_raccoon · April 22, 2023, 2:09pm

(Yeah I agree that they might sneak in a chess plugin)

RickMycroft · April 22, 2023, 2:49pm

von Neumann was right that chess has no hidden data, but I wonder how many grand masters would make pungent remarks about there being no bluffing, deception or trying to skull-fuck your opponent?

The problem with a chess cheat is that the class of problems that GPT can’t handle with fuzzy answers is much larger than chess. Once there’s a better understanding of the limitations of LLM, the hype is going to take a major hit, with yo-yoing stock prices, and techbro billions in play.

robertmckenna · April 22, 2023, 2:55pm

GPT=NFT 2023.

Lot of money to be lost, billions wasted that could have been used to do something useful. See also $100 billion and ten years wasted on self driving cars. Just to gamble that you can make some people unemployed and the world just a bit worse.

RickMycroft · April 22, 2023, 3:40pm

At least self-driving car research produces useful spin-offs like driver assists against dangerous lane-changes or backing over a kid in the driveway, even autonomous vehicles in controlled situations.

I doubt there will be any from NFTs.

LLM might produce useful assists that help rather than replace humans.

Hopefully there will be a Robot Hell for the con men who dress these up with hype and sizzle, and get people killed, either directly like Musk, or statistically like Sam Altman, all to play their money games.

Topic		Replies	Views
Train your AI with the world's largest data-set of sarcasm, courtesy of redditors' self-tagging boing	23	2337	May 6, 2017
Google is restructuring to put machine learning at the core of all it does boing	15	1777	June 28, 2016
Uganda installs Huawei's AI-powered facial recognition surveillance system 'nationwide' boing	17	1095	August 25, 2019
Cory Called it; AI Telemarketing is possibly here general topics	4	941	February 18, 2017
The problem with artificial intelligence is that it will do exactly what we ask it to do boing	14	1026	October 28, 2019

ChatGPT creates mostly insecure code, but won’t tell you unless you ask

What does an ex-Pharma Bro do next? If it’s Shkreli, it’s an AI Dr bot

Related topics