> Why this is happening. Two forces are slowing agentic commerce, according to Leigh McKenzie, director of online visibility at Semrush: infrastructure and trust. Real-time catalog normalization across tens of millions of SKUs is a decade-scale problem Google already solved with Merchant Center, and consumers still default to checkout flows they trust — Apple Pay, Google Wallet, and Amazon one-click.
It turns out when you step outside of “hard tech” problems like building GPT6 there are all of these details others have solved already. E-commerce has been optimized to the last decimal point for the last 30 years.
OpenAI is new to it, and if I had to guess, not that interested in getting good at it.
I think they're interested in getting good at it. They just don't want to put in the human time and effort to do so. They expect their many failures and short-comings to be shored up by continuous model training.
But that, of course, means that in the meantime it will suck and nobody will use it.
I think they're operating beyond their current (human) capacity, trying to test out too many things at a time.
But a dreamer in me entertains another idea: perhaps they're just holding back, because they realize that actually succeeding at this will instantly kill (or at least mortally wound) e-commerce as we know it.
(This is a more narrow version of my belief that general AI tools like LLMs fundamentally don't fit as additions to products, but rather subsume products, and this makes them an existential threat to the software industry. Not to software or computing, just to all the software vendors, whose job is to slice off pieces of computational universe, put them in boxes to prevent interoperability, and give each a name so it's a "product" that can be sold or rented).
> But a dreamer in me entertains another idea: perhaps they're just holding back, because they realize that actually succeeding at this will instantly kill (or at least mortally wound) e-commerce as we know it.
Sam Altman doesn’t give a shit about anyone but himself and has time and again shown he has no restraint for trampling over others to further his own goals. Why would e-commerce be where he draws the line?
I don't think there is any line drawn here. I think if they executed well (and by they I mean any one of the three SOTA LLM vendors), they could already mortally wound the entire software industry today.
Whether or not they want, or will want, to do it at some point, is unknown; the reasons to not do it now are obvious:
1) it's more profitable to keep renting intelligence per token to everyone, preserving the status quo and milking it indefinitely (i.e. while the models aren't yet good enough to reliably single-shot complex software products from half-baked prompts, because once they get there, disruption will happen organically)
2) trying to compete with ~every other software product today is not likely to succeed in the end; a serious attempt would still burn down the software industry, but the major players don't have the capacity to handle it all at once, and doing it gradually will give enough time for regulatory agencies to try and stop it; either way, no one wins
By embracing adversarial interoperability - instead of chasing hundreds of integration deals across industries that put LLMs in products, they focused fully on integrating product access into chat, by combination of business deals, apps/MCPs, and engineer/designer support for users, all directed towards the goal of having the LLM become the "superapp" where work is done, gradually replacing product classes in order of how easy it is.
There's lots of easy but drudge work to enable this that needs to be done at the fringes. For example, LLMs today could easily replace most people's smartphone homescreen experience, or travel/commute experience, as the data is there and LLMs have the capability, even prices are acceptable - what's missing is explicit first-party support to wire it up, keep it wired up.
One step up, what's missing is accepting this explicitly as a goal: to replace software, to make existing products (whether whole or in pieces) the tools AI uses to do work for you. All the vendors seem to carefully walk around the idea, but avoid engaging with it directly, because once they do, they'll be competing with everyone instead of milking them.
> This is a more narrow version of my belief that general AI tools like LLMs fundamentally don't fit as additions to products, but rather subsume products
That seems reasonable, its just yet to be seen if LLMs are a form of artificial intelligence in any meaningful sense of the word.
They're impressive ML for sure, but that is in fact different from AI despite how companies building them have tried to merge the terms together.
> but rather subsume products, and this makes them an existential threat to the software industry.
The US stock market has priced this in already. Many software only companies are perceived to be under threat by ai. It represents a wonderful arbitrage opportunity for ai skeptics in fact.
Why do you foresee OpenAI’s involvement in the software business mitigating the resistance to interoperability and companies making money through productization? If they were actually interested in solving those problems instead of trying to secure themselves the biggest slice of economic pie, wouldn’t they have been happy about Chinese companies distilling their models? Are they engagement-juicing inn their heavily subsidized service à la Uber because they’re interested in promoting a better future for humanity? I’m skeptical.
> E-commerce has been optimized to the last decimal point for the last 30 years.
It certainly hasn't been optimized to anything in 1996. In 1996 it was people clumsily scanning print catalogs, spending 5 hours to upload 10 images on dialup and making a simple HTML page (no DB or any kind of backend) and putting their landline phone on it with a message to "call to checkout"
I know you were exaggerating for effect, but E-commerce and catalog normalization are definitely not "solved" everywhere.
McMaster Carr is a good example of a company that has 90%+ of their stuff ironed out, but most websites and especially small ecommerce isn't like that.
I think you're misinterpreting what his comment meant. I read it as meaning that e-commerce has been optimised repeatedly over the 30 years, from a basic start (which as you pointed out was haphazard) to the point where it is now optimised to extract every possible cent from the user, whether by encouraging them to buy with one click (the Amazon one-click patent must be around 20 years old now), time-limited promo spot pricing, sending you e-mails about what you had in the basket if you don't complete a sale, etc...
Right now, by comparison, it sounds like AI based shopping is still in the very early stages. Maybe further along than the early e-commerce, but still with a long way to go in its evolution. That'll probably happen quicker than with e-commerce, because a lot of the knowledge about what does or doesn't work has already been learned, but it sounds like it's still a long way behind. Caveat - I've never used it myself, so I don't know how far it is along that path, I'm just basing that from the article.
Shopping research has been pretty funny to me at least, a straightforward way for them to do product placement that people actually want, but implement it so poorly that half of the links it returns are broken.
Also having to wait for ChatGPT for a "thinking" response to search for information that is slower than a Google search loses them lots of money.
I believe that it can still work and I won't claim about being unsurprised about this failure. But this is a great opportunity to execute this problem really well if OpenAI and others are not interested in getting good at this.
Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.
The only problem is that it must be quicker or just as quick as a Google search, and also compatible with the existing checkout flows.
> Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.
Any details on that? I feel the answer is more likely there than in "friction".
Hardly any purchase of consequence is so sensitive to friction that the difference between Google Search and an LLM response matters (especially that in reality, we're talking 20+ manual searches per one LLM response). I.e. I'm not going to use LLMs advise on some random 0-100$ purchase anyway, and losing #$ on a ##$ purchase due to suboptimal choice is not that big of a deal - but I absolutely am going to consult it (and have it compile tables and verify sources) on a $500+ purchase and for those I can afford spending few more minutes on research (or rather few hours less, compared of doing it the usual way).
Wow the sceptics really came out in force for this one.
I’m currently using Gemini to research components for a remote controlled plane. I have the frame of the plane and now need to buy correctly specced servo motors, an engine, battery, etc etc. It has saved me so much time and educated me tremendously on how the different components interact and the options available.
If I could just press “buy” from within Gemini and pay via Google Pay (or better still, Apple Pay) I’d do it in a heartbeat.
How many people tried for the novalty with no intention of purchasing? It being a thousand times worse conversion wouldn't matter if they are additional sales???
> It’s like corporations are angry that they need to go through us to get our money.
This is why I think the "you're the product" saying is wrong. You're just some annoyance to managers (whether they're trying to use you just for user numbers and ad views or they're trying to get your money), whose product is the company (shares or just outright selling the company).
What’s your example for this? Because my experience in e comm is that targeted advertising is awful (I bought a lawnmower last week, Amazon knows I bought it. I am now getting ads for lawnmowers, suggested products for lawn mowers, rather than lawn care, gardening tools, or anything to do with the lawnmower I’ve already bought), sites are absolutely overrun with ads and suggested placements for the product they want to sell me rather than the one I’ve searched for, and that everyone except Amazon interrupts the checkout flow with multiple up-sells, verifications, 2FA prompts, 3d Secure validations…
Why is this good? I want an impartial consistent system for shopping. If I can find it at a different site for a lower price, I should be able to do so. I should also be able to have it give me non-bot reviews and ask relevant questions about the product.
The same way I think shopping at Amazon is better than a place like Nike due to objectivity and comparison, I think a chat interface has the potential to take this to another level since places like Amazon have degraded considerably in terms of things like fake third party products and fake reviews.
The buyer of this technology is not shoppers, it's retailers. The measurement of quality is "does it make us more money?" not "does it help me make better buying choices."
Retailers do not want you to make better choices. They want you to buy the widget.
A lot of evidence suggests that also shoppers aren't that interested in making the best choice either. They want to make a tolerable choice with as little effort as possible. There is no basically no consumer market for "power shopping" outside of weird niches like pcpartpicker.com etc.
Is there a way to measure users "making the best choice?" You could measure the amount of time spent comparison-shopping, but most people are terrible at that anyway; it's an acquired skill for sure. Besides a willingness to spend time, it seems like an impossible-to-quantify metric even in the abstract.
That's a cynical way to look at it. Most likely the LLM will take a cut of sales and they'd be more or less indifferent who cuts the check. There's a market for this sort of thing. People will go to the best LLM for shopping. If the LLM is a shitty product, people will switch. LLMs are increasingly commoditized.
All you say is true for an aggregator like Amazon. But Amazon is better than Nike.com because as an aggregator they go from 1 to many retailers. LLMs will go from 1 aggregator (Amazon) to many so it will be better. And they don't have to invest a lot in UI/UX as chat is the interface.
I do agree with your conclusion, but the catalog in most online shops is certainly not impartial. Amazon sells the entire first page of search placement, for example.
Catalog is impartial? Then why are ~40% of every search I do on Amazon a sponsored product? There is no pure "catalog" especially with cheap crap coming out every day from no-name Chinese labels.
Am I the only one that think Amazon has gotten pretty awful in the last 5 years?
> (Good) E-commerce has been ruthlessly optimised to get shoppers to products they'll actually buy and then remove all distractions from buying.
The only e-commerce site that fits this standard is that old one for buying (IIRC) nuts and bolts or such, that pops up on HN every other year, and whose name sadly escapes me now. Everyone else is ruthlessly optimizing their experience to fuck shoppers over and get them to products the vendor wants them to buy, not the products the shoppers actually want (or need).
> A chat interface is just fundamentally incompatible with this. The agent makes it too easy to ask questions and comparison shop.
That is precisely the point.
Chats may suck as an interface, but majority of the value and promise of end-user automation (and more than half the point of the term "User Agent" (as in, e.g., a web browser)) is in enabling comparison shopping in spite of the merchants, and more generally, helping people reduce information asymmetry that's intertwined with wealth and power asymmetry.
But it's not something you can generally sell to the vendors, who benefit from that asymmetry relative to their clients (in fact, I was dumbfounded to see so much interest on the sales/vendor side for such ideas, but I blame it on general AI hype).
Adversarial interoperability is the name of the game.
Just made an order from them. It's weirdly comforting to know there's a company that knows I need clevis bolts and is willing to sell them to me for a transparent price.
Not sure you're aware but you initially sound like you disagree with the post you replied to, only to follow up by enthusiastically reiterating that author's words as if in agreement.
You realize what shoppers and vendors each consider to be "good" e-commerce sites are fundamentally opposed concepts?
Maybe? I'm not sure which way the OP is arguing, in particular because of that "(Good)". So perhaps I misread the comment as arguing the opposite of what it is.
> (Good) E-commerce has been ruthlessly optimised to get shoppers to products they'll actually buy and then remove all distractions from buying.
I don't think so. I know for a fact that search terms are a minefield of gotchas and hacks caused by product decisions that reflect ad-hoc negotiations with partners and sellers. It's an unstable equilibrium of partners trying to shift attention to their products in a certain way. I think that calling this fragile equilibrium optimized has no bearing with reality.
> I don't think so. I know for a fact that search terms are a minefield of gotchas and hacks caused by product decisions that reflect ad-hoc negotiations with partners and sellers. It's an unstable equilibrium of partners trying to shift attention to their products in a certain way. I think that calling this fragile equilibrium optimized has no bearing with reality.
You think a crude, unoptimised "minefield" is the route that leads to something as delicate as a "fragile equilibrium?" I don't see something as carefully balanced as your unstable equilibrium even being something that could exist without the processes involved having been refined down to a science. The only real alternative that meets your narrative would be that this is an industry that runs entirely on hope and luck (and enough human sacrifices to keep ample supplies of both on hand).
When I shop for special hardware (e.g. bicycle shift gear) it is usually underspecified.
If the information does not exist in the text block, a chat bot is of no use.
Chat bots don't belong to an e-commerce site; chat bots belong on the outside, specifically to comparison-shop and pull in some external information to de-bullshitify offers, correct "mistakes" and "accidental omissions" in the listings, resolve the borderline-fraudlent crap companies play these days with store-specific and season/promotion-specific SKUs with different parameters all resolving to same model/make name (think Black Friday/Cyber Monday deals that are not actually deals, just inferior hardware with dedicated SKU).
The use case for chat interfaces would be as follows:
Grandma wants to buy a good bike, but doesn't know about types of wheels or how many gears they need, or what type of frame is appropriate for their body type.
Reliable information on this does not exist on vendor sites, though. It exists on Reddit and in books and in med/physio papers and bunch of other places a SOTA model has read in training or can (for now) access via web search.
LLMs are already very good for shopping, but only as long as they sit on the outside.
We are talking about a hypothetical sales chatbot which would be built alongside the business, so they absolutely have the capacity and information necessary to train the chatbot to advise their own clients.
The problem is that the chat transcript is legally binding. If the chatbot makes incorrect statements which the customer relies on for their complex purchase, then you're going to have to refund them.
The idea that AI will suddenly solve e-commerce demonstrates a lack of understanding on everything that has happened in this space over the last 25 years.
There’s a lot of this going on in AI at the moment. New folks come in thinking they have a magic solution and then produce a total train wreck as it turns out domain expertise is still a thing.
What would it even mean for an ai to solve e-commerce? Is that a claim made here? Is e-commerce a problem to be solved? It seems like e-commerce does just fine, if plagued by poor quality product, fake reviews, and relentless borderline fraudulent marketing.
Was anyone suggesting AI would help with it? It seems from the article that Walmart (presumably experts in e-commerce) themselves willingly collaborated with open ai. Especially at Walmart's level, what even was the theory?
In any case. It seems that despite this poor result, Walmart decided to essentially go ahead anyway and partner with open ai to put their "own chatbot" inside the open ai app?
Well everyone is desperate to show that they’ve built and deployed some AI thing. Few have done so and demonstrated meaningful value that justifies all the expense of doing it.
You can either have AI be honest or AI become a marketing tool. The two are fundamentally incompatible.
You won't get it to push your products when users ask what's the best XYZ - either because it'll be too honest to lie or because it'll be too expensive for you.
I’ve been running e-commerce systems for 30 years (tech, marketing, etc). This was going to fail from the start for one reason: intent.
Most people using AI chat are exploring ideas and solutions. They’re doodling, not shopping. Or in old timey parlance, they’re looky-loos or tire kickers at best.
Anyone who’s had to justify ad spend in e-commerce can tell you that some sources produce huge traffic with absolutely terrible conversion. Reddit and Pinterest pretty much blow for this reason, with limited exceptions. It’s also why TikTok and other influencer platforms really work.
Conversion requires a mental shift from discovery to demand.
Also, really hate summaries like this without the actual source so here are the main points from the actual source (WIRED https://archive.is/7DuEV):
1. Instant Checkout inside ChatGPT performed poorly, with conversion about one-third of Walmart’s normal site.
2. The experience failed largely because it forced single-item purchases instead of letting users build a cart.
3. Walmart is shifting to embedding its own assistant, Sparky, inside ChatGPT and keeping checkout on its own system.
4. ChatGPT is still valuable because it’s driving significantly more new customer traffic than search.
5. Purchases that did work were mostly practical, problem-solving items like supplements and tools.
6. Fully automated “agentic shopping” is still unlikely in the near term because people want control over purchases.
7. OpenAI is moving away from in-chat checkout and focusing on helping users research while merchants handle transactions.
In short, AI is useful for discovery, but traditional e-commerce flows still outperform it at closing sales.
Would be interesting to know for other retailers though and how much of this is down to what Walmart sells?
I'm confused by the comment that it failed because it forced single item purchases. Most of my "ecommerce" use is researching and buying one item at a time.
Last year they couldn't draw fingers on hands properly, this year they can't convert at checkouts, I wonder what they'll be failing to do a year from now.
It's probably stuff like this along with investor pressure that will make AI companies slowly make their AIs more profit maximizing (and the long term reason ilya etc was so against even going down that path)
Sounds like they are an intermediary between the user and the business. So it’s not a top-up thing. That would be needlessly complex from ChatGPT’s end, too.
Your argument is "they're designed to influence us" right?
Amazon reviews are paid influence. Reddit posts are paid influence. Everything everywhere you read online is paid influence. I'd rank LLMs between "people I personally trust" and "random people online."
For me, for now, they are. And being "many random people" and not "random person", they average out into something much more trustworthy than even recommendations from most individuals I know personally.
Operative word is "for now" - LLMs caught entrepreneurs unprepared, but they'll catch up and poison this too, same thing that happened with search giving rise to SEO.
The experience is a lot like when you are talking with a friend, then they decide to ask siri or google a question using voice. The result is always imprecise. Meaning they either have to repeat their query, or end up typing it anyway.
If you want to buy a Walmart product, the easiest way is to go to Walmart. Why add an imprecise middle man in between?
That doesn't seem terribly surprising, a human can quickly look through a grid of shirts to find one they like. ChatGPT would be guessing what they might want and the human would probably get a bad experience there with some regularity.
Perhaps clickthrough is worse because there are fewer dark patterns involved and people are mostly just browsing and occasionally buying only what they need.
They didn't really seem to specify the "why" of it with any research. And weird that OAI wasn't supporting them to see wha the issue was.
What if they made instant checkout actually instant? You ask ChatGPT to setup a website and it instantly purchases web hosting and sets up the website there. You can't beat a 100% conversion rate by actually checking out instantly. If you didn't like that host you can ask it to find it alternative and ChatGPT would automatically attempt a refund and then purchase from someone else.
I'd become afraid to ask that bot anything, not knowing what it would automatically try to purchase for me. Paying for the bot itself is already a lot: $20/mo to $200/mo or more.
What are we talking about here? ChatGPT as an interface to buy groceries? Or ChatGPT as an interface to build a website. I fail to see how these would be related other than using a specific technology
> Because most people can't read. Wait for agents generating personalized websites/self checkout apps.
> You ask ChatGPT to setup a website and it instantly purchases web hosting and sets up the website
Multiple comments deflecting from the original shopping conversion failure to recommend ... building a whole new website (with hosting for some reason?). W/o bothering to look through commenter history, one has to assume there are a lot of chatbots on this site or else the people using this stuff have been lobotomized.
I'm sure it'll start happening too, and when that fails, the bots will, i don't know, invent a new macarena. We are definitely headed for an irredeemably stupid future.
> Walmart will embed its own chatbot, Sparky, inside ChatGPT. Users will log into Walmart, sync carts across platforms, and complete purchases within Walmart’s system.
Hah, Clippy's cousin Sparky: every once in a while after ChatGPT answers a question it'll say "Looks like you still have stuff in your WalMart cart. Would you like me to complete that checkout for you? Also, WalMart-brand diapers is on offer this week, shall I add that to your cart?"
> Why this is happening. Two forces are slowing agentic commerce, according to Leigh McKenzie, director of online visibility at Semrush: infrastructure and trust. Real-time catalog normalization across tens of millions of SKUs is a decade-scale problem Google already solved with Merchant Center, and consumers still default to checkout flows they trust — Apple Pay, Google Wallet, and Amazon one-click.
It turns out when you step outside of “hard tech” problems like building GPT6 there are all of these details others have solved already. E-commerce has been optimized to the last decimal point for the last 30 years.
OpenAI is new to it, and if I had to guess, not that interested in getting good at it.
I think they're interested in getting good at it. They just don't want to put in the human time and effort to do so. They expect their many failures and short-comings to be shored up by continuous model training.
But that, of course, means that in the meantime it will suck and nobody will use it.
In most circles, that is "not that interested in getting good at it".
But a dreamer in me entertains another idea: perhaps they're just holding back, because they realize that actually succeeding at this will instantly kill (or at least mortally wound) e-commerce as we know it.
(This is a more narrow version of my belief that general AI tools like LLMs fundamentally don't fit as additions to products, but rather subsume products, and this makes them an existential threat to the software industry. Not to software or computing, just to all the software vendors, whose job is to slice off pieces of computational universe, put them in boxes to prevent interoperability, and give each a name so it's a "product" that can be sold or rented).
Sam Altman doesn’t give a shit about anyone but himself and has time and again shown he has no restraint for trampling over others to further his own goals. Why would e-commerce be where he draws the line?
Whether or not they want, or will want, to do it at some point, is unknown; the reasons to not do it now are obvious:
1) it's more profitable to keep renting intelligence per token to everyone, preserving the status quo and milking it indefinitely (i.e. while the models aren't yet good enough to reliably single-shot complex software products from half-baked prompts, because once they get there, disruption will happen organically)
2) trying to compete with ~every other software product today is not likely to succeed in the end; a serious attempt would still burn down the software industry, but the major players don't have the capacity to handle it all at once, and doing it gradually will give enough time for regulatory agencies to try and stop it; either way, no one wins
I find their software to be of subpar quality and resilience anyways.
There's lots of easy but drudge work to enable this that needs to be done at the fringes. For example, LLMs today could easily replace most people's smartphone homescreen experience, or travel/commute experience, as the data is there and LLMs have the capability, even prices are acceptable - what's missing is explicit first-party support to wire it up, keep it wired up.
One step up, what's missing is accepting this explicitly as a goal: to replace software, to make existing products (whether whole or in pieces) the tools AI uses to do work for you. All the vendors seem to carefully walk around the idea, but avoid engaging with it directly, because once they do, they'll be competing with everyone instead of milking them.
That seems reasonable, its just yet to be seen if LLMs are a form of artificial intelligence in any meaningful sense of the word.
They're impressive ML for sure, but that is in fact different from AI despite how companies building them have tried to merge the terms together.
The US stock market has priced this in already. Many software only companies are perceived to be under threat by ai. It represents a wonderful arbitrage opportunity for ai skeptics in fact.
They definitely would if they could. They desperately need money. They already told the whole world they want to replace them, they just can’t.
Considering the money they need, they over promise and under deliver.
It certainly hasn't been optimized to anything in 1996. In 1996 it was people clumsily scanning print catalogs, spending 5 hours to upload 10 images on dialup and making a simple HTML page (no DB or any kind of backend) and putting their landline phone on it with a message to "call to checkout"
I know you were exaggerating for effect, but E-commerce and catalog normalization are definitely not "solved" everywhere.
McMaster Carr is a good example of a company that has 90%+ of their stuff ironed out, but most websites and especially small ecommerce isn't like that.
Right now, by comparison, it sounds like AI based shopping is still in the very early stages. Maybe further along than the early e-commerce, but still with a long way to go in its evolution. That'll probably happen quicker than with e-commerce, because a lot of the knowledge about what does or doesn't work has already been learned, but it sounds like it's still a long way behind. Caveat - I've never used it myself, so I don't know how far it is along that path, I'm just basing that from the article.
Already your favourite e commerce site has all your data. You can switch on the "buy this automatically" feature.
I believe that it can still work and I won't claim about being unsurprised about this failure. But this is a great opportunity to execute this problem really well if OpenAI and others are not interested in getting good at this.
Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.
The only problem is that it must be quicker or just as quick as a Google search, and also compatible with the existing checkout flows.
Any details on that? I feel the answer is more likely there than in "friction".
Hardly any purchase of consequence is so sensitive to friction that the difference between Google Search and an LLM response matters (especially that in reality, we're talking 20+ manual searches per one LLM response). I.e. I'm not going to use LLMs advise on some random 0-100$ purchase anyway, and losing #$ on a ##$ purchase due to suboptimal choice is not that big of a deal - but I absolutely am going to consult it (and have it compile tables and verify sources) on a $500+ purchase and for those I can afford spending few more minutes on research (or rather few hours less, compared of doing it the usual way).
I’m currently using Gemini to research components for a remote controlled plane. I have the frame of the plane and now need to buy correctly specced servo motors, an engine, battery, etc etc. It has saved me so much time and educated me tremendously on how the different components interact and the options available.
If I could just press “buy” from within Gemini and pay via Google Pay (or better still, Apple Pay) I’d do it in a heartbeat.
If ChatGPT can do this today, I need to try it.
Why would anyone have an extra layer of friction too where things could go wrong, where handing over payment details in another chain.
Just let me buy my stuff in peace. Shopping is not the 'killer app' for GenAI.
A chat interface is just fundamentally incompatible with this. The agent makes it too easy to ask questions and comparison shop.
It’s like corporations are angry that they need to go through us to get our money.
This is why I think the "you're the product" saying is wrong. You're just some annoyance to managers (whether they're trying to use you just for user numbers and ad views or they're trying to get your money), whose product is the company (shares or just outright selling the company).
The same way I think shopping at Amazon is better than a place like Nike due to objectivity and comparison, I think a chat interface has the potential to take this to another level since places like Amazon have degraded considerably in terms of things like fake third party products and fake reviews.
Retailers do not want you to make better choices. They want you to buy the widget.
A lot of evidence suggests that also shoppers aren't that interested in making the best choice either. They want to make a tolerable choice with as little effort as possible. There is no basically no consumer market for "power shopping" outside of weird niches like pcpartpicker.com etc.
All you say is true for an aggregator like Amazon. But Amazon is better than Nike.com because as an aggregator they go from 1 to many retailers. LLMs will go from 1 aggregator (Amazon) to many so it will be better. And they don't have to invest a lot in UI/UX as chat is the interface.
> for a lower price
Catalog is impartial, chatbot is ads pretending as advice.
Am I the only one that think Amazon has gotten pretty awful in the last 5 years?
The only e-commerce site that fits this standard is that old one for buying (IIRC) nuts and bolts or such, that pops up on HN every other year, and whose name sadly escapes me now. Everyone else is ruthlessly optimizing their experience to fuck shoppers over and get them to products the vendor wants them to buy, not the products the shoppers actually want (or need).
> A chat interface is just fundamentally incompatible with this. The agent makes it too easy to ask questions and comparison shop.
That is precisely the point.
Chats may suck as an interface, but majority of the value and promise of end-user automation (and more than half the point of the term "User Agent" (as in, e.g., a web browser)) is in enabling comparison shopping in spite of the merchants, and more generally, helping people reduce information asymmetry that's intertwined with wealth and power asymmetry.
But it's not something you can generally sell to the vendors, who benefit from that asymmetry relative to their clients (in fact, I was dumbfounded to see so much interest on the sales/vendor side for such ideas, but I blame it on general AI hype).
Adversarial interoperability is the name of the game.
Sadly Sigma-Aldrich, the hyphenated retailer for chemistry, appears to have been covered in javascript sludge.
You realize what shoppers and vendors each consider to be "good" e-commerce sites are fundamentally opposed concepts?
I don't think so. I know for a fact that search terms are a minefield of gotchas and hacks caused by product decisions that reflect ad-hoc negotiations with partners and sellers. It's an unstable equilibrium of partners trying to shift attention to their products in a certain way. I think that calling this fragile equilibrium optimized has no bearing with reality.
You think a crude, unoptimised "minefield" is the route that leads to something as delicate as a "fragile equilibrium?" I don't see something as carefully balanced as your unstable equilibrium even being something that could exist without the processes involved having been refined down to a science. The only real alternative that meets your narrative would be that this is an industry that runs entirely on hope and luck (and enough human sacrifices to keep ample supplies of both on hand).
If we are talking custom products or complex appliances that need a lot of guidance, then maybe chat interface is appropriate.
Grandma wants to buy a good bike, but doesn't know about types of wheels or how many gears they need, or what type of frame is appropriate for their body type.
LLMs are already very good for shopping, but only as long as they sit on the outside.
https://www.bbc.co.uk/travel/article/20240222-air-canada-cha...
There’s a lot of this going on in AI at the moment. New folks come in thinking they have a magic solution and then produce a total train wreck as it turns out domain expertise is still a thing.
Was anyone suggesting AI would help with it? It seems from the article that Walmart (presumably experts in e-commerce) themselves willingly collaborated with open ai. Especially at Walmart's level, what even was the theory?
In any case. It seems that despite this poor result, Walmart decided to essentially go ahead anyway and partner with open ai to put their "own chatbot" inside the open ai app?
And given the past few decades there is no reason to not try to do that.
This probably means that OpenAI et al are fine-tuning salesman-like LLMs to "fix" that problem.
Can't wait for the future.
You won't get it to push your products when users ask what's the best XYZ - either because it'll be too honest to lie or because it'll be too expensive for you.
Most people using AI chat are exploring ideas and solutions. They’re doodling, not shopping. Or in old timey parlance, they’re looky-loos or tire kickers at best.
Anyone who’s had to justify ad spend in e-commerce can tell you that some sources produce huge traffic with absolutely terrible conversion. Reddit and Pinterest pretty much blow for this reason, with limited exceptions. It’s also why TikTok and other influencer platforms really work.
Conversion requires a mental shift from discovery to demand.
Also, really hate summaries like this without the actual source so here are the main points from the actual source (WIRED https://archive.is/7DuEV):
1. Instant Checkout inside ChatGPT performed poorly, with conversion about one-third of Walmart’s normal site.
2. The experience failed largely because it forced single-item purchases instead of letting users build a cart.
3. Walmart is shifting to embedding its own assistant, Sparky, inside ChatGPT and keeping checkout on its own system.
4. ChatGPT is still valuable because it’s driving significantly more new customer traffic than search.
5. Purchases that did work were mostly practical, problem-solving items like supplements and tools.
6. Fully automated “agentic shopping” is still unlikely in the near term because people want control over purchases.
7. OpenAI is moving away from in-chat checkout and focusing on helping users research while merchants handle transactions.
In short, AI is useful for discovery, but traditional e-commerce flows still outperform it at closing sales.
I'm confused by the comment that it failed because it forced single item purchases. Most of my "ecommerce" use is researching and buying one item at a time.
> AI accounts for 90% of accidents while only accounting for 1% of traffic
https://abit.ee/en/cars/waymo-robotaxi-autonomous-driving-sa...
I sort of trust them to make product recommendations, but at best I will only open a link they suggest and buy the product there.
Does it actually need direct access to your wallet? I haven't tried it yet, but assumed it would work with a separate wallet, fed through by top-ups.
https://chatgpt.com/merchants/
Never, ever.
Amazon reviews are paid influence. Reddit posts are paid influence. Everything everywhere you read online is paid influence. I'd rank LLMs between "people I personally trust" and "random people online."
Operative word is "for now" - LLMs caught entrepreneurs unprepared, but they'll catch up and poison this too, same thing that happened with search giving rise to SEO.
Do you have a better alternative?
Your product has to be a 10x improvement over the incumbent to be competitive.
In AI speak it would be the “extra-bitter” lesson I guess?
You need to add 10x resources to beat a product that’s already solved with mature tech.
If you want to buy a Walmart product, the easiest way is to go to Walmart. Why add an imprecise middle man in between?
Perhaps clickthrough is worse because there are fewer dark patterns involved and people are mostly just browsing and occasionally buying only what they need.
They didn't really seem to specify the "why" of it with any research. And weird that OAI wasn't supporting them to see wha the issue was.
> You ask ChatGPT to setup a website and it instantly purchases web hosting and sets up the website
Multiple comments deflecting from the original shopping conversion failure to recommend ... building a whole new website (with hosting for some reason?). W/o bothering to look through commenter history, one has to assume there are a lot of chatbots on this site or else the people using this stuff have been lobotomized.
I'm sure it'll start happening too, and when that fails, the bots will, i don't know, invent a new macarena. We are definitely headed for an irredeemably stupid future.
The latest AI is trained on the average citizens social media output. Iq 90.
That’s why AI seemed smart. The bar will not be raised again. We’re cooked.
The enshittification is upon us.