POST DeepSeek

Everyone in the tech world, and everyone else beyond, cannot have failed to notice the ructions in the American markets because of the Chinese generative AI “DeepSeek”. As I am fascia

 

Steven Sinofsky argues pretty convincingly that DeepSeek was inevitable. He notes that the history of IT is innovation followed by scale up which is then broken by a model that “scales out”. That is when the bigger and faster approach is replaced by a smaller and more numerous approach.  With that in mind, then, it was a matter of time before a scale out solution arrived, a solution that with different architectural approaches that use less capital to train the AI models. He thinks, as do I, the US innovators will respond with their own scale out approaches: DeepSeek does not mean that American has lost an AI “war” with China.

xxx

DeepSeek r1 is not smarter than earlier models, just trained more cheaply
It doesn’t solve hallucinations or problems with reliability.

From: Five things most people don’t seem to understand about DeepSeek.

 

 

Remember that such hallucinations are not simply an occasional error in LLM output, they are feature of the way that LLMs work. You can certainly implement a variety of mitigations, but you are not going to eradicate them, and they have real world consequences. This has been evident from thea rl 

 

CharGPT also couldn’t seem to stop making things up, a phenomenon experts called “hallucinations.” One radio host in Georgia, US, sued OpenAl in the summer of 2023 for def-amation, claiming that ChatGPT had falsely aceused him of embezzling money, Not long after, two lawyers in New York were fined after they submitted a legal brief they’d cribbed from ChatGPT, which included fake case citations. Users were finding that sometimes, when they asked ChatGPT for sources of its information, it would make those up too.

OpenAl refused to disclose what ChatGPT’s hallucination rate was, but some Al researchers as well as regular users put it at roughly 20 percent, meaning that at least for certain users, and in about one in five instances, ChatGPT was fabrieating infor mation. The tool had been designed to be as useful as possible and to err on the side of confidence; the downside to that was it was often spewing hogwash. Not only were more people using a tool that made it easier to skip the process of hard thinking, they were often being fed misinformation that sounded persuasive and even 

If you are interested in understanding more about the underlying theory, there are a number of useful papers on the topic. One is Xu, Jain and Kankanhalli’s “Hallucination is Inevitable: An Innate Limitation of Large Language Models” which shows that you cannot get rid of hallucinations in real world LLMs no matter what you do because they an inherent in the method. This limitation does not mean that there is no use for generative AI in financial services. There is plenty of work going on in Small Language Models (SLMs) right now. These are trained on fewer parameters, with weights and balances that are tailored to individual use cases. They hallucinate less (which should make mitigation more practical) and they are also faster and cheaper.

xxx

DeepSeek’s success points to an unintended outcome of the tech cold war between the US and China. US export controls have severely curtailed the ability of Chinese tech firms to compete on AI in the Western way—that is, infinitely scaling up by buying more chips and training for a longer period of time. As a result, most Chinese companies have focused on downstream applications rather than building their own models. But with its latest release, DeepSeek proves that there’s another way to win: by revamping the foundational structure of AI models and using limited resources more efficiently.

From: How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI | WIRED.

xxx

Enfield: Woman held for taking Life in the UK tests ‘in disguise’ – BBC News

xxx

A 61-year-old woman has been arrested after allegedly dressing up in a series of wigs and disguises to take Life in the UK tests on behalf of others.
Immigration enforcement investigators arrested the woman at an address in Enfield on Monday on suspicion of fraudulently completing the citizenship tests for at least 14 applicants, both male and female.
She is alleged to have completed the tests across multiple test centres in the UK, disguising herself and doctoring ID documents to evade detection.

From: Enfield: Woman held for taking Life in the UK tests ‘in disguise’ – BBC News.

xxx

DOGE Takes Aim At The Penny | Digg

xxx

Elon Musk’s Department of Government Efficiency is targeting one of the federal government’s most notorious examples of waste: the penny. Getting rid of the penny would test whether DOGE can eliminate a piece of government inefficiency that has survived decades of reform attempts.
Key Details

In fiscal year 2023, taxpayers spent more than $179 million producing over 4.5 billion pennies, with each coin costing more than three cents to make.

From: DOGE Takes Aim At The Penny | Digg.

xxx

Crypto is celebrating but Trump’s boosterism could end badly

xxx

The new president wants America to become the crypto capital of the planet, floating a proposal to create an official US bitcoin reserve. Establishment of such a reserve would give bitcoin an official imprimatur. But it makes little sense.

From: Crypto is celebrating but Trump’s boosterism could end badly.

xxx

Crypto is celebrating but Trump’s boosterism could end badly

Eswar Prasad, a professor at Cornell University and a senior fellow at Brookings, drew an interesting parallel between the US government support for cryptocurrencies and the Chinese government’s support for the domestic property sector to drive its economy while touting it as a way for households to build their own wealth. Now that that property bubble is bursting, the burden is falling heavily on the lower-income households who locked up a large share of their savings in property or are now stuck with failed developers.

Crypto is celebrating but Trump’s boosterism could end badly

xxx

The switch in Trump’s views on cryptocurrencies — from sceptic to vocal advocate — does not mask the reality that nothing has changed in the fundamentals of this asset class, including its lack of intrinsic value.

From: Crypto is celebrating but Trump’s boosterism could end badly.

xxx

POST Hello Operator?

OpenAI’s “Operator” is an AI agent that can take control of a web browser to automate tasks such as booking travel accommodations, making restaurant reservations and shopping online. Operator is powered by a Computer-Using Agent (CUA) model  that combines the vision capabilities of the OpenAI’s GPT-4o model with reasoning abilities from their more advanced models. What this means is that an agent doesn’t need an API to access services, it can use buttons, navigate menus and presumably pass the “prove you are not a robot” puzzles just as people do.

(OpenAI reports that it is collaborating with companies such DoorDash, eBay, Instacart, Priceline, StubHub, Uber and others to ensure that the agents respect the terms of service agreements.)

As it stands, Operator requires human supervision for certain categories of task including, as I am sure you would expect, financial transactions, so that consumer currently need to take control to enter payment information, for example. With the advent of CUA, we can now see the practical evolution of full-blown agentic commerce in strategi timeframes. Agents will go online to obtain services, look for an agentic API and then, if no such APIs is found, simply access the web pages as a human customer does.

Spanish PM Calls to End Social Media Anonymity, Force Digital ID: WEF

xxx

Speaking at the World Economic Forum (WEF) Annual Meeting in Davos today, Prime Minister Sanchez went on a 17-minute tirade against social media and big tech companies for being dangerous to democracies while presenting three solutions:

Force social media platforms to link user accounts with the EU digital identity wallet and end anonymity

From: Spanish PM Calls to End Social Media Anonymity, Force Digital ID: WEF.

xxx

Keep some cash at home because of cyber attacks, DNB says – DutchNews.nl

xxx

Dutch banks are poised to recommend their clients always keep cash money hidden at home, because of the risk that payment systems may be disrupted because of mounting geopolitical tensions.

The Dutch central bank (DNB)said on Wednesday that the chance of a cyber attack on important infrastructure is increasing, particularly from Russia. If the digital payment system is disrupted, then people will no longer be able to pay for goods by bank card, or transfer money automatically.

From: Keep some cash at home because of cyber attacks, DNB says – DutchNews.nl.

xxx

17 Years After Their Invention, Cryptocurrencies Have No Use Case | by Aure’s Notes | Notes | Jan, 2025 | Medium

xxx

No physical businesses and very few online businesses are accepting crypto payments, no one is getting paid in crypto, and those who are use it to evade taxes. Investments in cryptos remain extremely low on a global scale with only 562 million people (or 6.8% of the world population) owning some form of cryptocurrencies.
Some see those low numbers as a sign that the industry is still in its early days. The alternative story is that cryptos have no practical use cases (except making creators and early adopters rich), hence, very few adopters.
The market is now established enough that it’s unlikely to change in the future.
In the mid-term, we can expect the value of Bitcoin to keep climbing.
In the long term though, it’s likely that the blockchain will be made redundant by a superior technology that will be able to break it.

From: 17 Years After Their Invention, Cryptocurrencies Have No Use Case | by Aure’s Notes | Notes | Jan, 2025 | Medium.

xxx

Design a site like this with WordPress.com
Get started