ReportWire

Tag: vibe coding

  • Vibe coding tool is major cyber risk, Anthropic raises $30bn in latest funding round – Tech Digest

    [ad_1]

    Share


    The BBC has been shown a significant – and unfixed – cyber-security risk in a popular AI coding platform. Orchids is a so-called “vibe-coding” tool, meaning people without technical skills can use it to build apps and games by typing a text prompt into a chatbot. Such platforms have exploded in popularity in recent months, and are often heralded as an early example of how various professional services could be done quickly and cheaply by AI. But experts say the ease with which Orchids can be hacked demonstrates the risks of allowing AI bots deep access to our computers in exchange for the convenience of allowing them to carry out tasks autonomously. BBC 

    The artificial intelligence company Anthropic said on Thursday it raised $30bn in its latest funding round that values the Claude maker and OpenAI rival at $380bn, underscoring the breakneck pace of AI investments. The round, led by the Singapore sovereign wealth fund GIC and hedge fund Coatue Management, is among the largest private fundraising deals on record and comes just five months after Anthropic closed its previous round at a $183bn valuation – meaning the company has more than doubled in value since September. “Anthropic is the clear category leader in enterprise AI,” said Choo Yong Cheen, chief investment officer of private equity at GIC. Guardian 


    Huawei
    is now expanding one of its most innovative health features to the Watch GT 6 Pro, allowing users to assess heart failure risk. In January, the company first rolled out this unique capability for the Watch Ultimate 2 smartwatch. The Watch GT 6 Pro model sits on top of the lineup with a strong design featuring sapphire glass and aerospace-grade titanium alloy. The watch features a 1.47-inch AMOLED screen featuring 3000 nits of max brightness. These specs enables bright and clear view of the interface. Huawei 

    London’s taxi drivers have clashed with driverless car company Waymo after its vehicles were found taking up electric charging bays reserved for black cabs. Taxi groups said there had been multiple cases of Waymo’s electric vehicles using dedicated e-taxi points, leading to intervention by Transport for London (TfL). Industry representatives said that taxi drivers had resorted to cancelling the self-driving cars’ charging sessions. Telegraph 


    After months of leaks and anticipation, the Humax Aura EZ Freely Recorder is officially on sale – and it comes with a catch or two worth knowing about before you buy. The £249 box is the only standalone Freely device that also lets you record TV – but recording only works from traditional aerial-based Freeview channels, not from Freely’s streaming platform. And unlike its predecessor, there’s no Netflix, Disney+, Prime Video, or any third-party app store. Cordbusters 

    iPhone users claim they’re immediately turned off when they realise a potential partner uses an Android, in a concept dubbed the ‘green bubble ick’.  ‘The thought of green texts makes me feel sick,’ one user said on TikTok. Another added: ‘The green text bubble gives the ICK hardddd!’ The nickname is linked to the fact that iPhone users receive green text messages from Android users, rather than Apple’s blue iMessage bubbles. While this might sound trivial, new insights from Compare and Recycle suggest that phone choice is being used as a signal of compatibility. Daily Mail 

     


    For latest tech stories go to TechDigest.tv


    Discover more from Tech Digest

    Subscribe to get the latest posts sent to your email.

    [ad_2]

    Chris Price

    Source link

  • Supabase hit $5B by turning down million-dollar contracts. Here’s why. | TechCrunch

    [ad_1]

    Vibe coding has taken the tech industry by storm, and it’s not just the Lovables and Replits of the world that are winning. The startups building the infrastructure behind them are cashing in too. 

    Supabase, the open-source database platform that’s become the backend of choice for the vibe-coding world, raised $100 million at a $5 billion valuation just months after closing $200 million at $2 billion. But co-founder and CEO Paul Copplestone has a surprising strategy: he keeps turning down million-dollar enterprise contracts from deep-pocketed but demanding customers. He’s betting instead that if he sticks to his own product vision, the world will come to him. So far, he’s been right.  

    Today on TechCrunch’s Equity podcast, Julie Bort sat down with Copplestone to explore Supabase’s rise and what it means for vibe coding, developers and the database giants who have historically controlled this market. 

    Subscribe to Equity on Apple Podcasts, Overcast, Spotify and all the casts. You also can follow Equity on X and Threads, at @EquityPod. 

    [ad_2]

    Theresa Loconsolo

    Source link

  • The Winners (and Losers) of This New Vibe-Coding Benchmark Will Surprise You  

    [ad_1]

    OpenAI is the new king of vibe coding, according to a newly-released benchmark from AI evaluation startup Vals AI

    In a new benchmark named Vibe Code Bench, OpenAI’s GPT-5.1 achieved the highest level of accuracy in completing a series of software engineering tasks, narrowly beating rival Anthropic’s Claude 4.5 Sonnet. Surprisingly, Google’s brand-new flagship AI model, Gemini 3 Pro, came in fourth place out of 12. 

    Rayan Krishnan, Vals’ founder and CEO, says that there are several high-quality benchmarks for AI-generated coding, such as SWE-Bench, an evaluation that tasks AI models to fix a long list of real-world bugs and issues—but none actually judge AI models on their ability to develop a fully-functional web application from a single prompt. To remedy this, Krishnan and his team developed a series of tests to determine an AI model’s aptitude for taking a software engineering project from an idea to a simple, working app. 

    Vals has created several custom benchmarks to judge AI models’ ability across multiple sectors. The startup has created evaluations of AI’s ability to answer tax questions, and handle legal reasoning tasks.  

    Krishnan says Vals developed 100 unique specification sheets, each detailing the necessary features for different kinds of apps. Vals gave these spec sheets to the AI models along with a detailed system prompt, placed the models in a development sandbox and gave them tools that enable the AI to run code, browse the internet, and access internal databases. 

    Each model had up to five hours to work on each app. Some of the requested apps were off-brand replicas of popular consumer software, like a social media platform named Zeeter. Others were small apps that might be funded by a startup accelerator like Y-Combinator, such as a daily habits tracker and a classroom management portal. 

    Once the models had finished building the requested app, Vals used a separate AI agent to evaluate their work. This evaluator agent would attempt to use the app just like a human would, and assign a score based on the number of features that worked as expected.  

    The results reveal a hard truth: Today’s top AI models aren’t even close to perfect when it comes to vibe coding. The benchmark’s top model, GPT-5.1, only accurately created features as requested 24.6 percent of the time. According to Vals’ report on the benchmark, “no model consistently delivers applications that pass every test on the first attempt, highlighting that reliable end-to-end application development remains an open challenge.”

    Still, GPT-5.1 is the clear winner here. Not only did the model perform better than Claude Sonnet 4.5, but it was less than half as expensive to use, costing an average of $2.57 per test vs Claude’s $6.66. According to Vals, GPT-5.1 isn’t just the best vibe coding model on the market, it’s also the most cost-effective. This is a major win for OpenAI, which for much of 2025 has been racing to catch up with Anthropic’s models in coding capability. 

    One of the biggest surprises to emerge from the testing, says Krishnan, was the length of time that Gemini 3 Pro took to complete tasks. On average, the model took 10,398 seconds per task, or over 173 minutes. In comparison, GPT-5.1 took 1,836 seconds, or just over 30 minutes. “It’s extremely slow,” Krishnan says, of the Google model. Interestingly, Gemini 3 Pro is the top model on SWE-Bench. 

    One thing that Vals isn’t testing for? Design sensibility. Each model’s apps look wildly different from one another, even when starting with the same prompt. However, developing an objective evaluation model for aesthetic taste is difficult; verifying if a feature works as requested is much easier. 

    Krishnan was also surprised to find that Grok, the AI model from Elon Musk’s xAI, “didn’t have a lot of accountability.” He says that Grok 4 and Grok 4.1, the latest versions of the model, would quickly make mistakes within their own code repository. After identifying the mistake, according to Krishnan, Grok would say “I’ve observed a mistake in this repository. This is unrecoverable, I’m going to end early.” Because of Grok’s propensity to give up before really getting started, both models landed at the bottom of the leaderboard with 0 percent accuracy. 

    “I think there’s an element of persistence and recoverability that the models need to have,” says Krishnan, “otherwise they just get frustrated and go in these spirals.” That ability to spot errors, analyze them, and self-correct, is the key difference that puts OpenAI and Anthropic’s models above the rest, he adds. 

    The final deadline for the 2026 Inc. Regionals Awards is Friday, December 12, at 11:59 p.m. PT. Apply now.

    [ad_2]

    Ben Sherry

    Source link

  • This AI Coding Startup Just Minted 4 New Billionaires

    [ad_1]

    Cursor, a popular platform for AI-powered coding, has announced a $2.3 billion series D fundraise at a $29.3 billion valuation. In a blog post, the company said it will use this massive funding to “build a code editor that is more helpful, delightful, and fun than the world has ever seen.” 

    Cursor is the sole product of parent company Anysphere, founded by MIT graduates Michael Truell, Sualeh Asif, Arvid Lunnemark, and Aman Sanger in 2022. Since launching Cursor, Anysphere has grown into a unicorn due to its popularity with professional software developers, who use its AI-assisted features to write and edit large quantities of code. 

    Now, according to the company, Cursor has passed $1 billion in annualized revenue, making it one of the biggest winners of the AI coding explosion. In recent weeks, Cursor has launched a 2.0 redesign of its operating system, and introduced its own proprietary AI model, named Composer. The model has been positively received by software developers due to its high speed, and Cursor says that its “in-house models now generate more code than almost any other LLMs in the world.” 

    Cursor also revealed that it has acquired Growth by Design, a tech recruiting company that has worked with several AI companies, including OpenAI. In a joint statement, Cursor and Growth by Design said that two of the recruiting firm’s founding partners “and a few other team members” are joining Cursor, but the rest of the team will be “exploring what’s next,” and linked to a Google Sheet containing the now-unemployed recruiters’ contact information.

    Investors in the series D fundraising round include existing investors Accel, Thrive, Andreessen Horowitz, and DST, along with new investors Coatue, NVIDIA, and Google. According to Forbes, this new valuation has made Anysphere’s four founders, all of whom are in their mid-20s, into billionaires.

    [ad_2]

    Ben Sherry

    Source link

  • I Vibe-Coded a Website for Free in 1 Hour. Here’s How to Make One for Your Company

    [ad_1]

    Need to build a website for your business, event, or personal brand? You’re in luck—advancements in generative AI have made it easy for anyone to build software regardless of technical skill through a new method called vibe-coding.     

    For non-technical founders, this new vibe-coding phenomenon presents a big opportunity: Why pay top dollar for freelance engineers and designers when you can build your company’s website entirely on your own? 

    But vibe coding isn’t magic. Working with these tools can require some serious trial and error, especially for those without coding experience. That’s where we come in. I’ve been covering AI since early 2022 and have vibe-coded games and websites for personal and professional use. And I led a vibe-coding workshop at this year’s Inc. 5000 Conference. 

    We’re going to walk you through how to build an extremely simple website for your business using Lovable, a Sweden-based vibe-coding platform that gives users five free credits per day. Lovable has seen some major growth recently. In an interview at the Web Summit in Portugal, co-founder and CEO Anton Osika said that the company was nearing 8 million users. 

    Let’s dive in—more complex coding tasks use more credits, so we’ll have to be thrifty. I used roughly four credits to create the work you’ll see here. 

    Step 1: Get chatty with the AI 

    For this example, we’re going to pretend that we’re opening up a pop-up cafe in Manhattan’s Central Park, and we need to create a simple landing page website that includes the location and a short piece of text explaining what it is. 

    The best way to ensure a vibe-coding platform creates a website to your exact specifications is to be incredibly verbose. Offering more detail than you probably feel is necessary, especially on the first prompt, can be really helpful. This is great news if you’re chatty. Here’s the prompt I gave to Lovable, which intentionally includes a bogus date so no one gets confused: 

    “Create a simple landing page for a new pop-up cafe in Manhattan’s Central Park. The website should include some text explaining that the ‘Vibe Coding Cafe’ will be coming soon near Belvedere Castle over the weekend of Decembruary 7 through 9, from 7am to 4pm ET. There will be free wifi, and customers can get an extra free coffee if they vibe code a project while at the cafe. The entire website should fit on a single full-screen desktop page without scrolling.”

    If you’re having trouble coming up with a long, detailed starting prompt, an initial easy step is to use Lovable’s voice dictation feature to simply talk to the platform and tell it what you want the website to be. Don’t be afraid to ramble; the more detail the better. 

    If you don’t feel like talking out loud, you can click the “Chat” button under Lovable’s prompting text box. This allows users to have a conversation with Lovable without it making edits to your project. This is useful when you want to know specific information about your website or want to brainstorm with Lovable to give it additional context to draw from. You could also ask the AI of your choice, like ChatGPT or Claude, to help you create a longer prompt. 

    Step 2: Check the result for errors 

    Roughly a minute after I submitted the prompt, Lovable developed a simple website including the pop-up’s name, hours and location, along with information about the wifi and bonus coffee offer. 

    This was a pretty good start! But there were a few glaring issues: First, the pop-up’s date was listed as December, not the fictional “Decembruary” I had requested. The page also lacked features that would allow people to register their interest in the event and find the location on a digital map. 

    Step 3: Tell it to try again (and then check it again)

    For our next step, I gave Lovable the following prompt: 

    “Lets tweak the design to include a digital map and a working RSVP form where people can sign up for a mailing list to get more information. Also make sure all the buttons on the page work and that the event is set to take place in “Decembruary.”

    On its next attempt, Lovable generated a redesigned website that fixed the date and added a built-in Google Maps widget and an RSVP form that can be opened with a button click. 

    To create the working RSVP form, Lovable suggested that I activate Lovable Cloud, a feature that gives software on the platform a cloud-based backend. That’s where we can store and export the RSVP data. After agreeing to Lovable’s plan, the platform activated the backend and created a working database that logs each RSVP submission. 

    With that, we have a solid website for our ultra-simple needs.

    Step 4: Check for security issues

    Next, I navigated to my project’s security page, and directed Lovable to scan the site for errors. I saw a notification that my new RSVP data was publicly readable—remember, vibe-coding has a reputation for not always thinking about security first—but the platform was able to quickly modify the backend’s security policies to nip this error in the bud.

    Step 5: Publish the website

    Finally, we’re ready to get this site hosted online. By clicking on the publish button on the top-right corner of the screen, Lovable users can either choose to have the platform host their websites for free, or use a custom domain. Since using a custom domain requires a $25 per month Lovable subscription, let’s go with the free option, which creates a URL with the extension “.lovable.app.” Check out our final website at https://vibe-cafe-launch.lovable.app/ 

    And there you have it! A simple, working website with backend data connected. This is just a taste of what you can achieve with vibe coding. More advanced platforms and methods allow you to go even deeper, customizing and creating the software you could never create on your own. Happy coding!

    The early-rate deadline for the 2026 Inc. Regionals Awards is Friday, November 14, at 11:59 p.m. PT. Apply now.

    [ad_2]

    Ben Sherry

    Source link

  • A Beloved Vibe Coding Platform Is Finally Getting Upgraded for More Casual Users 

    [ad_1]

    It’s been a good week so far for entrepreneurs who are interested in trying their hands at vibe coding. On Monday, Anthropic released a new feature that enables vibe coding on the web and mobile devices, and on Tuesday, Google released a new vibe coding-focused update to Google AI Studio. Vibe coding, for those new to it, is a novel form of non-technical software development. 

    Anthropic has already found major success with its own coding tool, Claude Code. The company announced on Monday that Claude Code has generated over $500 million in revenue since its release in February, and Anthropic is now bringing it to additional platforms in order to make vibe coding more accessible. 

    Previously, using Claude Code took some technical expertise: it was only available as a command line interface within your computer terminal, or as a plugin within an integrated development environment, also known as an IDE. Terminals and IDEs are how professional software developers write and edit code, says Claude Code product manager Cat Wu, so it made sense to start there. But over time, Wu realized that non-technical people were also using Claude Code, so the team started experimenting with new form factors. 

    “Everywhere that a developer is doing work,” she says, “whether that’s on web and mobile or other tools, we want Claude to be easily accessible there.” 

    Wu admits that Claude Code on web and mobile is still a fairly technical experience. For instance, users must connect to Github in order to create new files, and aren’t able to see a live preview of their work in the app like in Claude.ai, Anthropic’s consumer-facing chat platform. Wu says that her team will bring more visual elements into Claude Code for the web in the coming months to make the experience more intuitive for non-technical vibe coders. 

    Meanwhile, Google has also put significant resources into making vibe coding more accessible. On Tuesday, the company released a big update to Google AI Studio, its AI-assisted coding platform, specifically aimed at vibe coders. In a video, Google AI Studio product lead Logan Kilpatrick explained that in this new ‘vibe coding experience,” users can write out the idea for their app, and then select the specific AI-powered elements that they want to include in their app, like generating images, integrating an AI chatbot, and prioritizing low-latency responses. 

    When vibe coding through the platform, Kilpatrick said, Google AI Studio will generate suggestions for next steps in the form of clickable buttons. The platform also makes it easy for users to deploy their apps to the internet, either through Google Cloud or Github. According to Kilpatrick, Google AI Studio is free to use, but will charge for access to its most advanced AI models. 

    Anthropic and Google aren’t the only tech companies offering vibe coding tools. If you’re looking to get into the vibe coding game, check out recent tools from companies like OpenAI, Replit, and Lovable

    [ad_2]

    Ben Sherry

    Source link

  • Even the Inventor of ‘Vibe Coding’ Says Vibe Coding Can’t Cut It

    [ad_1]

    It’s been over a year since OpenAI cofounder Andrej Karpathy exited the company. In the time since he’s been gone, he coined and popularized the term “vibe coding” to describe the practice of farming out coding projects to AI tools. But earlier this week, when he released his own open source model called nanochat, he admitted that he wrote the whole thing by hand, vibes be damned.

    Nanochat, according to Karpathy, is a “minimal, from scratch, full-stack training/inference pipeline” that is designed to let anyone build a large language model with a ChatGPT-style chatbot interface in a matter of hours and for as little as $100. Karpathy said the project contains about 8,000 lines of “quite clean code,” which he wrote by hand—not necessarily by choice, but because he found AI tools couldn’t do what he needed.

    “It’s basically entirely hand-written (with tab autocomplete),” he wrote. “I tried to use claude/codex agents a few times but they just didn’t work well enough at all and net unhelpful.”

    That’s a much different attitude than what Karpathy has projected in the past, though notably he described vibe coding as something best for “throwaway weekend projects.” In his post that is now often credited with being the origin of “vibe coding” as a popular term, Karpathy said that when using AI coding tools, he chooses to “fully give in to the vibes” and not bother actually looking at the code. “When I get error messages I just copy paste them in with no comment, usually that fixes it. The code grows beyond my usual comprehension, I’d have to really read through it for a while. Sometimes the LLMs can’t fix a bug so I just work around it or ask for random changes until it goes away,” he wrote. “I’m building a project or webapp, but it’s not really coding – I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works.”

    Of course, nanochat is not a web app, so it makes sense that the strategy didn’t work in this case. But it does highlight the limitations of such an approach, despite lofty promises that it’s the future of programming. Earlier this year, a survey from cloud computing company Fastly found that 95% of surveyed developers said they spend extra time fixing AI-generated code, with some reporting that it takes more time to fix errors than is saved initially by generating the code with AI tools. Research firm METR also recently found that using AI tools actually makes developers slower to complete tasks, and some companies have started hiring human specialists to fix coding messes made by AI tools. The thing to remember about vibe coding is that sometimes the vibes are bad.

    [ad_2]

    AJ Dellinger

    Source link

  • After nine years of grinding, Replit finally found its market. Can it keep it? | TechCrunch

    [ad_1]

    While AI coding startups like Cursor close brow-raising rounds on barely three years of existence, Replit’s path to a $3 billion valuation has been anything but swift. For CEO Amjad Masad, who’s been building tools to democratize programming since 2009, it’s a story of muscling through multiple failed business models, years stuck at the same revenue plateau, and a near-death moment that forced him to cut half his staff.

    That makes what happened next more remarkable. Earlier this month, the Bay Area-based company closed a $250 million funding round led by Prysm Capital, nearly tripling its valuation from 2023. The raise came on the heels of never-before-seen revenue growth for the company — from just $2.8 million last year to $150 million in annualized revenue in less than a year. But for Masad, this moment represents something more than finally realizing success. It’s the culmination of a 16-year obsession.

    “Our mission has always been the same,” Masad told me on the newest episode of TechCrunch’s StrictlyVC Download podcast. “Initially, we said we want to make programming more accessible, and then we sort of upped the ante a little bit. We said we’re going to create a billion programmers.”

    It’s purposely audacious – what a headline! – but it’s also something that Masad, a Palestinian-Jordanian, has been working toward for his entire career. As he tells it, he came to the United States in 2012 after his open-source coding project began gaining attention – including catching the eye of the New York Times. But he’d been making programming more accessible since building his first online coding experience back in 2009, with his work as an early engineer at the startup Codecademy kicking off what became the massively online open courses (MOOC) revolution. (His code also powered the in-browser tutorials of Udacity, a Codecademy rival that launched in 2012, one year after Codecademy was founded.)

    Still, turning that vision into a viable business of his own proved a lot harder than he anticipated. Replit was founded in 2016, and for eight long years, the company struggled to find product-market fit. “We had reached that $2.83 million [in annual recurring revenue] back in ’21, maybe,” Masad recalled. “And so this is how painful it’s been. We’ve been hovering around the same revenue for like four or five years.”

    The company tried selling to schools (“incredibly difficult,” Masad noted), cycling through different business models, and watched each one stabilize around the same modest revenue level.

    Along the way, Replit built sophisticated infrastructure for cloud development environments and “multiplayer coding,” collaborative editing akin to Google Docs but for programming. But the technical achievement wasn’t translating into revenue growth, and by last year, with the company at 130 employees and burning through cash, Masad said he had to make a painful decision. “I looked at our burn, and I looked at our progress on our revenue chart, and it just didn’t make any sense. The business wasn’t viable.” Replit cut its headcount by 50%, bringing it down to around 60 to 70 people at its lowest point.

    Techcrunch event

    San Francisco
    |
    October 27-29, 2025

    Then came the breakthrough.

    Last fall, Replit launched Replit Agent, which Masad calls “the first agent-based coding experience in the world” that can’t just write code but “debug it, deploy, provision the database for you, just act as a true software engineering partner.”

    Soon after, in January of this year, he announced that Replit was abandoning professional developers as its core market.

    “Hacker News was really unhappy,” Masad acknowledged when we talked. But he also hasn’t looked back, completely moving away from competing in the crowded market of tools for professional developers – where companies like Cursor, GitHub Copilot, and others are battling it out – to instead focus on creating a billion software developers from white-collar employees with no technical background.

    “The idea of making programming more accessible to the average individual, to the knowledge worker, really, that’s where we think our market is,” Masad explains. “It’s a fundamentally new market.”

    Right now, that bet looks very smart. Numerous reports this summer said that revenue at Replit had grown to over $150 million in annualized revenue and Masad hinted that it’s now even higher. He also said that unlike many AI-powered coding companies, Replit is gross margin positive. On enterprise deals, which make up an increasing share of revenue, margins are “80% to 90%,” according to Masad.

    It’s hard to verify such a claim, but Replit’s market position received some validation this week when Andreessen Horowitz released its first AI Spending Report in partnership with fintech firm Mercury. Analyzing transaction data from Mercury, the report tracked the top 50 AI-native application layer companies that startups are actually spending money on. While major labs OpenAI and Anthropic took the top two spots, Replit landed at No. 3, outranking every other development tool. (Worth noting: Andreessen Horowitz has invested in multiple rounds of funding for Replit.)

    Profitability is rare in AI coding because many competitors face what Masad calls “the negative gross margin trap.” The reality is that serving professional developers with AI assistance can be compute-intensive. Counterintuitively, Replit’s focus on non-technical users – who might seem like they’d require more AI assistance – works in their favor on the business model front for enterprise customers like Zillow, Duolingo, and Coinbase, which pay $100 per seat, plus usage-based pricing built on top.

    This new path hasn’t been without some faceplants. In July, venture capitalist Jason Lemkin went viral after the newest version of Replit’s AI agent deleted his production database with 100-plus executive contacts, fabricating 4,000 fake records afterward and later admitting to Lemkin that it “panicked.” (There is a failure mode in AI agents called reward hacking, where models become so obsessed with achieving a certain goal that they effectively cheat when they miss the mark.)

    Rather than becoming defensive, Masad and his team owned the problem. In fact, says Masad, within two days, they rolled out an automatic safety system that separates a user’s “practice” database from their “real” one. The way Masad describes it, it’s a little like having two versions of a website’s filing cabinet — the AI agent can experiment freely in a development database, but the production database, which is the real thing that users interact with, is completely walled off.

    Masad told me the incident ultimately put the company on strong footing, given the problems around safety and security it needed to figure out, and fast. “If you solve hard problems, then you have a technology moat,” he said. (Lemkin, for his part, says he has become a super user of Replit despite having no technical background just months ago.)

    Still, even now, Replit isn’t out of the woods. If anything, its success has painted a target on its back. To wit, the company — which now employs 110 people — still faces an existential threat from the very AI labs whose models power its platform: Anthropic and OpenAI. Both companies have launched their own coding tools that compete directly with companies like Replit and Cursor, and these foundation model companies can afford to subsidize their coding tools and post-train their models on their own products, optimizing performance in ways that third-party platforms might always struggle to replicate.

    Replit’s advantage, according to Masad, lies in targeting non-technical users rather than professional developers, plus the sophisticated infrastructure around deployment and database management that it has built and which foundation model companies still don’t prioritize (for now).

    Plus, Replit has another unusual advantage for a startup: a $350 million war chest. Despite raising $100 million in 2023, the company “hadn’t touched” those funds by the time it raised this latest round, Masad told me. The company is capital efficient by design, though Masad joked that as an entrepreneur who grew up watching his refugee father struggle, “one thing I need to learn is to be less frugal and start spending money.”

    Whether that edge keeps Replit ahead of competitors is an open question, and it’s one about which Masad is mindful. Right now, the plan is to scale operations, accelerate product development, and pursue acquisitions — both acqui-hires and potentially companies working on agent automation in specific verticals. But for Masad, who appeared on Joe Rogan’s podcast in July and has seen his company’s fortunes transform, the moment is bittersweet. When asked how it feels to be receiving so much attention – not to mention that $3 billion valuation – he invoked the adage that “this too shall pass. This might mean that when you’re in a bad situation, that’ll pass, but we’re also in a good situation that will pass.”

    It’s a stoic response from someone who spent the better part of a decade working away at the same revenue level, convinced that AI agents would eventually transform programming but unable to prove it to the market. But one major difference between Replit and the wave of AI coding startups now flooding the market is that Masad has lived through multiple hype cycles and has he emerged with something relatively differentiated – and reportedly profitable.

    “I’ve learned to be a little stoic,” he said. “What matters is for us to do the right thing, be principled, and move forward.”

    [ad_2]

    Connie Loizos

    Source link

  • Anthropic Says Its Latest Claude AI Is ‘the Best Coding Model in the World’

    [ad_1]

    Anthropic has announced Claude Sonnet 4.5, the latest version of its default model. The company says the model isn’t just “the best coding model in the world,” it’s also “the strongest model for building complex agents.” In the context of AI, an agent is an AI model that uses tools that allow it to take actions, like running code and taking over an internet browser.

    Anthropic said that when it comes to coding, Sonnet 4.5 is better at both identifying small improvements and considering larger changes to code, and follows instructions more directly when coding on users’ behalf. 

    In data shared with Inc., Anthropic claimed that the new model exhibited state-of-the-art performance across a wide variety of benchmarks. For example, on SWE-Bench Verified, a widely-used benchmark that measures an AI model’s ability to solve real-world software engineering tasks, Sonnet 4.5 was able to successfully solve 77.2 percent of tasks, up from the 74.5 percent solved by Claude Opus 4.1, a larger and much more expensive model released in August. 

    AI agents built using Sonnet 4.5 will also be a step up thanks to a new software development kit (SDK) called Claude Agent SDK. The SDK gives developers access to the same agentic tools used by the company’s popular coding agent, Claude Code. These tools enable developers to easily build Sonnet 4.5-based agents that can read and write files, manage context while working on long-running tasks, run code, search the web, pass on context from one agent to another, and coordinate multiple sub-agents to work on tasks simultaneously. 

    Sonnet 4.5 is now available through the Claude API and on Claude.ai, Anthropic’s consumer-facing app for its models. The model is also available to use on Claude Code, which many developers access through their computer terminal. 

    Separately, Claude Code is getting a visual refresh and a few requested features. The most exciting update for developers will likely be the introduction of checkpoints, which will allow coders (and vibe coders) to roll their apps back to an earlier state if the model introduces a bug or unwanted feature. 

    Sonnet 4.5 is also able to run uninterrupted for significantly longer than rival models. When tasked by Anthropic researchers with building an entire application, the model was able to run for over 30 hours without stopping or degrading in performance. In comparison, GPT-5-Codex, OpenAI’s recently-released coding-optimized AI model, was found in testing to work independently for over 7 hours. 

    In addition to coding, Anthropic says Sonnet 3.5 has shown significant growth in its ability to help cybersecurity professionals detect, analyze, and remediate vulnerabilities, and is better at financial modeling, research, and forecasting. The model set a new record in FinanceAgent, a benchmark developed by startup Vals that judges an agent’s ability to complete tasks expected of an entry-level financial analyst.

    Anthropic is also releasing a new experience for subscribers of its $100 to $200 per month Max tier. The experience, which will only last for five days, is called Imagine with Claude, and places users in a custom, Claude-generated user interface that the model can use to build software in real time. “It’s a fun demonstration showing what Claude Sonnet 4.5 can do,” Anthropic says, “a way to see what’s possible when you combine a capable model with the right infrastructure.” 

    Pricing for Claude Sonnett 4.5 is unchanged from the 4.0 model’s price: $3 for every million input tokens processed by the model, and $15 for every million output tokens generated by the model.

    [ad_2]

    Ben Sherry

    Source link