FBI says artificial intelligence to impersonate Marco Rubio

Published

7 hours ago

July 9, 2025

The Editors

FBI says artificial intelligence to impersonate Marco Rubio – CBS Miami

Watch CBS News

An individual is using an artificially generated voice to impersonate Secretary of State Marco Rubio, officials say.

Source link

Nvidia AI challenger Groq announces European expansion — Helsinki data center targets burgeoning AI market

Published

26 minutes ago

July 9, 2025

The Editors

American AI hardware and software firm, Groq (not to be confused with Elon Musk’s AI venture, Grok), has announced it’s establishing its first data center in Europe as part of its efforts to compete in the rapidly expanding AI industry in the EU market, as per CNBC. It’s looking to capture a sizeable portion of the inference market, leveraging its efficient Language Processing Unit (LPU), application-specific integrated circuit (ASIC) chips to offer fast, efficient inference that it claims will outcompete the GPU-driven alternatives.

“We decided about four weeks ago to build a data center in Helsinki, and we’re actually unloading racks into it right now,” Groq CEO Jonathan Ross said in his interview with CNBC. “We expect to be serving traffic to it by the end of this week. That’s built fast, and it’s a very different proposition than what you see in the rest of the market.”

It’s that speed and efficiency of hardware and operation that Ross believes will give Groq an edge in a market that’s currently dominated by Nvidia. While the established graphics card manufacturer has cornered the market when it comes to the hardware to train AI, Ross believes that Groq is well-positioned to take over the day-to-day running of those algorithms by powering the inference calculations that allow them to function so effectively.

“Inference tends to be a higher volume, but lower margin business,” he explained, suggesting Groq was happy to take that on. He even suggested Nvidia’s shareholders would be happy because it would help propel the industry forward, benefiting Nvidia in the long term.

However, he pulled no punches in going after Nvidia as competition for that inference business. Although complimentary of the power and impressive capabilities of Nvidia GPUs, Ross suggested that those general-purpose chips weren’t designed with AI in mind. Groq’s LPUs are.

They’re ASICs, which are specifically designed for AI inference calculations. They use on-chip memory to reduce the latency of tasks, and the chips are designed to handle the linear algebraic calculations that large language AI models require.

Ross claims that this makes its LPUs both fast and efficient, using around a third of the power of traditional GPU designs to produce the same output. Ross isn’t just banking on the speed of his company’s hardware, though; he’s banking on their speed of responsiveness.

“[Nvidia CEO] Jensen said at GTC, if you want to get GPUs in two years, you need to put your [Purchase Orders] in now […] That’s just a ridiculous requirement,” he said. “For us, it’s about six months. Which is a fourth of the time, and that totally changes your ability to make predictions about what you need.”

“Nvidia can only build as many GPUs as it’s looking to build this year, because it uses very exotic components like HBM […] We don’t use any of that, and so we’re not as supply limited, and that’s really important for inference.”

Backed by major investments from Samsung, and Cisco, Groq’s new data center is being built in partnership with American data center firm, Equinix. That will help it get up and running faster and provide opportunities for further expansion. Nvidia has its own long list of AI partners in building what CEO Jensen Huang called “AI factories,” all over the world, promising hundreds of billions of dollars of investment.

It feels a little like companies are taking sides, and that may be equally important from a software front as much as it is for the hardware. Nvidia has dominated much of the professional graphics card space for some time due to its CUDA software stack, which has become part of many early AI development toolkits built around its CUDA-X platform, in addition to the sheer size and scale of the overall business.

But even with Nvidia’s gargantuan position in the market, Groq appears confident it can compete. With its LPUs using a much simpler design than Nvidia GPUs, and leveraging a generic compiler, Groq claims its hardware and software can be leaner, faster, and just as, if not more, capable.

Fierce competition

(Image credit: Moreh)

Ross claimed Groq was ready for its own smaller, upstart competition too. Suggesting that even as hot as AI inference industry is, Groq should come out ahead of startups beginning to build their alternative offerings.

“You’ll get lots of startups popping up, but building an AI chip is expensive. You’re going to (be) spending anywhere from a quarter to half a billion dollars to get that thing to market, and you can’t fund everyone to do that.” Ross continued.

When the interviewer pressed about staff retention, highlighting the problems OpenAI had recently faced with Meta poaching staff with large signing bonuses, Ross appeared as unconcerned as Sam Altman (despite losing researchers to Meta’s efforts).

“I think we’ve had an easier time finding and retaining talent, because we’re a little adjacent to the AI research space […] That said, it is a hot industry, and there is a lot of pull for the best talent. In our case, I think a lot of people view us as having a very high growth trajectory to be successful, and they’d like to be on that path, so people join us very much for the equity and growth potential.”

Regardless of any potential staffing issues, though, Groq is likely to receive a warm welcome in Europe, with many countries in the EU and the UK looking to invest heavily in AI inference data centers in the coming years, as they look to compete with US and Chinese efforts.

If this data center can truly start effectively serving customers within weeks of decision makers launching the endeavour, it’s unlikely to be the last. However, the power and speed of adoption for Groq will remain crucial to the company’s future success. Nvidia made $35.6 billion from the last quarter in data center hardware alone, while AMD made $3.7 billion. If Groq wants to be a meaningful competitor in the space, the company needs to appeal to customers and companies looking to build their own ASIC solutions for AI workloads, which is a very specific subsection of the market.

Groq would also require an answer to how scalable their hardware is too, as potential hyperscalers looking for an ASIC solution will be watching the company closely. This is in tandem with potential competition from other, more established ASIC businesses, such as Broadcom, Marvell, and Mediatek. If Groq should prove itself worthy, then it might become a part of the same cohort of indirect Nvidia competitors, looking for a slice of the extremely hot AI pie.

Follow Tom’s Hardware on Google News to get our up-to-date news, analysis, and reviews in your feeds. Make sure to click the Follow button.

Source link

AI Insights

Poland Calls for EU Probe of xAI After Lewd Rants by Chatbot

Published

1 hour ago

July 9, 2025

The Editors

Poland’s government wants the European Union to investigate and possibly fine Elon Musk’s xAI following abusive and lewd comments made by its artificial intelligence chatbot Grok about the country’s politicians.

Source link

AI Insights

What is Context Engineering? The Future of AI Optimization Explained

Published

3 hours ago

July 9, 2025

The Editors

What if the key to unlocking the full potential of artificial intelligence lies not in the models themselves, but in how we frame the information they process? Imagine trying to summarize a dense, 500-page novel but being handed only scattered, irrelevant excerpts. The result would likely be incoherent at best. This is the challenge AI faces when burdened with poorly curated or excessive data. Enter the concept of context engineering, a fantastic approach that shifts the focus from static, one-size-fits-all prompts to dynamic, adaptive systems. By tailoring the information AI systems receive, context engineering promises to transform how large language models (LLMs) generate insights, solve problems, and interact with users.

In this exploration of context engineering, the Prompt Engineering team explain how this emerging discipline addresses the inherent limitations of traditional prompt engineering. You’ll discover how techniques like retrieval-augmented generation and context pruning can streamline AI performance, allowing models to focus on what truly matters. But context engineering isn’t without its challenges—issues like context poisoning and distraction reveal the delicate balance required to maintain precision and relevance. Whether you’re a developer seeking to optimize AI systems or simply curious about the future of intelligent machines, this perspective will illuminate the profound impact of dynamic context management. After all, the way we frame information might just determine how effectively machines—and by extension, we—navigate complexity.

What is Context Engineering?

TL;DR Key Takeaways :

Context engineering focuses on dynamically managing and curating relevant information for large language models (LLMs), improving task performance and minimizing errors compared to static prompt engineering.
Key challenges in context management include context poisoning, distraction, confusion, and clash, which can negatively impact the accuracy and coherence of LLM outputs.
Strategies like Retrieval-Augmented Generation (RAG), context quarantine, pruning, summarization, and offloading are used to optimize context and enhance LLM efficiency and accuracy.
Context engineering has practical applications in areas like customer support and research, where it dynamically adjusts context to improve user experience and streamline decision-making processes.
While some critics view context engineering as a rebranding of existing methods, its emphasis on adaptability and real-time optimization marks a significant advancement in AI development, paving the way for future innovations.

Context engineering is the practice of curating and managing relevant information to enable LLMs to perform tasks more effectively. It goes beyond static prompts by employing dynamic systems that adapt to the evolving needs of a task. The primary goal is to provide LLMs with a streamlined, relevant context that enhances their ability to generate accurate and coherent outputs.

For instance, when tasked with summarizing a lengthy document, an LLM benefits from context engineering by receiving only the most pertinent sections of the document. This prevents the model from being overwhelmed by irrelevant details, allowing it to focus on delivering a concise and accurate summary. By tailoring the context to the specific requirements of a task, context engineering ensures that the model operates efficiently and effectively.

Challenges in Context Management

While context engineering offers significant potential, it also introduces challenges that can impact model performance if not carefully managed. These challenges highlight the complexity of maintaining relevance and precision in dynamic systems:

Context Poisoning: Errors or hallucinations within the context can propagate through the model, leading to inaccurate or nonsensical outputs. This can undermine the reliability of the system.
Context Distraction: Overly long or repetitive contexts can cause models to focus on redundant patterns, limiting their ability to generate novel or insightful solutions.
Context Confusion: Including irrelevant or superfluous information can dilute the model’s focus, resulting in low-quality responses that fail to meet user expectations.
Context Clash: Conflicting information within the context can create ambiguity, particularly in multi-turn interactions where consistency is critical for maintaining coherence.

These challenges underscore the importance of precise and adaptive context management to maintain the integrity and reliability of the model’s outputs. Addressing these issues requires a combination of technical expertise and innovative strategies.

How Context Engineering Improves AI Performance and Relevance

Below are more guides on Context Engineering from our extensive range of articles.

Strategies to Optimize Context

To overcome the challenges associated with context management, several strategies have been developed to refine how context is curated and used. These techniques are designed to enhance the efficiency and accuracy of LLMs:

Retrieval-Augmented Generation (RAG): This method selectively integrates relevant information into the context, making sure the model has access to the most pertinent data for the task at hand. By focusing on relevance, RAG minimizes the risk of context overload.
Context Quarantine: By isolating context into dedicated threads for specialized agents in multi-agent systems, this approach prevents cross-contamination of information, preserving the integrity of each thread.
Context Pruning: Removing irrelevant or unnecessary information from the context streamlines the model’s input, improving focus and efficiency. This technique is particularly useful for tasks with strict context window limitations.
Context Summarization: Condensing earlier interactions or information preserves relevance while adhering to the model’s context window constraints. This ensures that key details remain accessible without overwhelming the model.
Context Offloading: External memory systems store information outside the LLM’s immediate context, allowing the model to access additional data without overloading its input. This approach is especially valuable for handling large datasets or complex queries.

These strategies collectively enhance the model’s ability to process information effectively, making sure that the context aligns with the specific requirements of the task. By implementing these techniques, developers can maximize the potential of LLMs in a wide range of applications.

Key Insights and Practical Applications

Effective context management is critical for maintaining the performance of LLMs, particularly as context windows expand. Smaller models, in particular, are more prone to errors when overloaded with irrelevant or conflicting information. By implementing dynamic systems that adapt context based on user queries and task requirements, you can maximize the model’s capabilities and ensure consistent performance.

In customer support applications, for example, context engineering can dynamically adjust the information provided to the model based on the user’s query history. This enables the model to deliver accurate and contextually relevant responses, significantly improving the user experience. Similarly, in research and development, context engineering can streamline the analysis of complex datasets by focusing on the most relevant information, enhancing the efficiency of decision-making processes.

Criticism and Future Directions

Some critics argue that context engineering is merely a rebranding of existing concepts like prompt engineering and information retrieval. However, its emphasis on dynamic and adaptive systems distinguishes it from these earlier approaches. By addressing the limitations of static prompts and focusing on real-time context optimization, context engineering represents a significant advancement in AI development.

As AI systems continue to evolve, the principles of context engineering will play a pivotal role in shaping how LLMs interact with and process information. By prioritizing relevance, adaptability, and precision, this approach ensures that AI systems remain effective and reliable, even in complex and dynamic environments. The ongoing refinement of context management techniques will likely lead to further innovations, allowing LLMs to tackle increasingly sophisticated tasks with greater accuracy and efficiency.

Media Credit: Prompt Engineering

Filed Under: AI, Top News

Latest Geeky Gadgets Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Source link