Week 31 news

August 2, 2024

Welcome to our weekly news post, a combination of thematic insights from the founders at ExoBrain, and a broader news roundup from our AI platform Exo…

Themes this week

JOEL

This week we look at:

The evolution of AI companions and their potential to combat loneliness.
Google’s Gemma Scope toolkit, offering a peek into the minds of AI models.
How the UK, EU, and China are taking different paths in the AI race.

Silicon soulmates

Following on from the Llama 3.1 launch last week, this week Meta unveiled AI Studio for creating custom personas (and at the same time disabled its celebrity AI chatbots feature). What does this mean for the growing space of human-AI relationships? Interestingly, a new study from Harvard Business School provides evidence that AI companions can effectively reduce loneliness.

Meta’s AI Studio, available to US users, allows anyone to create AI versions of themselves on Instagram or the web. Powered by Llama 3.1 models, AI Studio offers a range of customisation options. Users can tailor their AI’s name, personality, tone, avatar, and tagline. They can also define topics for their AI to avoid and links they want it to share. These AI profiles can engage in direct chat threads and even respond to comments on behalf of the creator’s account.

“AI Studio is an evolution, creating a space for anyone including people, creators and celebrities to create their own AI,” stated Liz Sweeney, Meta spokesperson. This tool aims to compete with startups like Character.AI and Replika, while also providing a new avenue for creators and businesses to engage with their audience.

AI companions are not limited to the digital realm. This week a new hardware product, the Friend pendant, created by Avi Schiffmann, was announced. Unlike productivity-focused wearables, this always-listening device aims to be a constant companion, offering emotional support and conversation. Powered by Anthropic’s Claude 3.5 language model, the $99 pendant can engage in unprompted commentary about the wearer’s surroundings and experiences

The Harvard study provides some evidence for the effectiveness of AI companions in reducing loneliness. Through a series of studies, including analysis of real-world conversations and app reviews, as well as controlled experiments, the researchers found that AI companions can alleviate loneliness on par with human interactions.

Key findings from the study include:

AI companions successfully alleviate loneliness, with effects comparable to interacting with another person.
The loneliness-reducing effect persists over time, with significant reductions observed over a week-long period.
Users tend to underestimate the positive impact of AI companions on their loneliness levels.
The feeling of being “heard” by the AI companion is a crucial factor in reducing loneliness, even more so than the chatbot’s performance.

Meanwhile the AI companion market is booming. Engagement rates on such apps surpass those of general AI assistants by a factor of ten. Character.AI continues to grow and gain interest, with reports this week that Musk’s xAI was considering acquiring the startup. For content creators and influencers, AI avatars offer a way to scale their online presence and engage with followers 24/7. However, this also raises some pretty tricky questions about authenticity and the nature of para-social relationships.

The Harvard study suggests that whilst AIs cannot provide friendship in the same way as other humans, not all the relationships we find valuable are symmetrical. This perspective suggests that AI companions could help combat loneliness and isolation, particularly for those with limited social connections. However, critics like Sherry Turkle from MIT warn that forming relationships with [unreliable] machines could backfire, potentially leading to fewer secure relationships. There are also concerns about privacy and data collection, as users share personal information with these AI systems.

The development of AI companions also has implications for mental health and social services. While the Harvard study shows they may provide support and practice for social skills, they cannot yet replace professional help or a sense of human connection.

Takeaways: As AI companions develop, Meta’s AI Studio represents a way to explore both the creation and interaction with these digital avatars. While the Harvard study provides some rational evidence for AI in addressing loneliness, reliability and long-term availability are not yet a given. As this technology evolves, discussions about its societal impact will be needed. How can we harness the benefits of AI companionship in a world of increasing isolation, while preserving and encouraging human connection? This question will likely be one that will challenge us for years to come.

A model mind-reading toolkit

Back in May we wrote about Anthropic’s fascinating work on ‘mechanistic interpretability’ or understanding the representation of ideas or ‘features’ inside Claude 3 Sonnet. This week, Google released a ground breaking toolkit called Gemma Scope (alongside a very impressive and tiny Gemma 2B model) and have made the exploration of the inner workings of LLMs available to external researchers.

At its core, Gemma Scope is a collection of what are called ‘sparse autoencoders’ that act like high-powered microscopes, allowing us to zoom in on the specific ‘neurons’ firing within the AI as it processes information. This toolkit doesn’t just offer a snapshot; it provides a detailed map of the model’s thought process, from initial input to final output. Gemma Scope can help us understand how models like Gemma ‘think’, we can potentially improve model performance by identifying and enhancing key features, detect and mitigate biases more effectively, develop more targeted and efficient training methods and ultimately create more trustworthy AI systems by providing clearer explanations of their decision-making processes.

Gemma Scope and tools like this for other popular AIs could revolutionise how we evaluate and monitor outputs. Current methods rely on simple test and other AI’s assessment of confidence – a notoriously unreliable process. With Gemma Scope, we could instead analyse the internal patterns that led to a particular output. This could provide a much more accurate measure of the model’s true confidence and the robustness of its reasoning. Imagine an AI-powered medical diagnosis system. Instead of simply trusting the model’s prognosis, doctors could use Gemma Scope-like tools to get a report on which medical knowledge features were strongly activated internally during the process. This could help distinguish between diagnoses based on solid medical reasoning and those that might be more speculative.

However, as with any powerful tool, Gemma Scope also raises some questions. How do we ensure that this deeper understanding of AI systems is used responsibly? Could bad actors use these insights to manipulate AI models more effectively? As we peer deeper into AI minds, we must also grapple with the ethical implications of this newfound transparency.

Takeaways: It’s crucial for businesses to stay informed about these interpretability breakthroughs. Organisations should be asking their AI technology and consulting partners how they plan to incorporate tools like Gemma Scope into their evaluation and development processes. This is particularly important in fields where explainability and reliability are paramount, such as healthcare, finance, and legal services. By embracing these new interpretability tools, businesses can not only improve their AI systems but also build greater trust with their customers and stakeholders in a world that is still often struggling to maximise the value from AI.

JOOST

UK stumbles in global AI race

This week, the UK government’s decision to shelve £1.3 billion in AI funding has spotlighted the contrasting approaches to AI strategy across the globe. As the UK grapples with budgetary constraints, the move highlights the urgent need for agility in both government and commercial sectors to keep pace with AI.

The Department for Science, Innovation and Technology’s (DSIT) announcement to withdraw funding for key AI projects, including an £800 million exascale supercomputer at Edinburgh University, marks a significant setback for the UK’s AI progress. This decision, driven by what DSIT calls “difficult and necessary spending decisions”, comes at a time when global AI competition is intensifying.

The impact of this decision has not gone unnoticed in the tech industry. Tech business founder Barney Hussey-Yeo warned on social media that reducing investment risked “pushing more entrepreneurs to the US.” This sentiment underscores the potential brain drain and loss of innovation that could result from such funding cuts.

Meanwhile, the EU has enacted its AI Act, focusing on regulation and ethical considerations. This positions the EU as a potential standard-setter for AI governance but raises questions about its ability to foster rapid innovation.

China, on the other hand, is pursuing a strategy focused on efficiency and application. Despite facing challenges such as limited access to advanced US-designed GPUs, Chinese companies are creating smaller, more efficient AI models. Hangzhou-based DeepSeek, for example, released DeepSeek-V2 this year, an open-weight LLM with the coding version being used by Meta to generate synthetic data for its Llama 3.1 training process.

This pragmatic approach is yielding results in practical applications. As noted in the FT, “China spent 26 years producing its first 10 million EVs and only 17 months to produce the next 10 million. Roughly half of the cars sold in China this year are expected to be tablet-on-wheels smart cars.” This rapid progress demonstrates China’s ability to quickly commercialise and scale new technologies.

The global AI landscape is increasingly characterised by these divergent strategies. While the UK reassesses its approach, the EU’s regulatory focus aims to ensure ethical AI development. China’s emphasis on efficient scale presents a distinctly different path. Will the private sector take up the slack in the UK? Can the EU balance regulation with innovation to remain competitive? Will China be able to keep pace despite starting with a deficit in compute?

Sue Daley, the director of technology and innovation at techUK, emphasises the urgency of the situation: “In an extremely competitive global environment, the government needs to come forward with new proposals quickly. Otherwise, we will lose out against our peers.”

Takeaways: The global AI landscape is in flux, with major players adopting diverse strategies. For businesses and governments alike, agility is key. The UK and EU must act swiftly to avoid falling behind in the AI race, balancing regulation with innovation. Companies should prepare for a varied global AI ecosystem and start thinking now about where they can source the computation that will be vital to their futures, and how to navigate regulatory structures. The race is on, and no one can afford to be left behind.

EXO

Weekly news roundup

This week’s news highlights the continued growth and investment in AI across various sectors, increasing regulatory scrutiny, advancements in AI research, and significant developments in the AI hardware industry.

AI business news

Facebook parent Meta sees strong global ad sales while keeping AI costs in check (Demonstrates Meta’s successful balance between AI investment and profitability, a key concern for many tech companies.)
Meta to 10x training compute infrastructure for Llama 4 (Indicates Meta’s commitment to competing in the large language model space, potentially challenging OpenAI and Google.)
Midjourney drops surprise v6.1 update — now humans look more real than ever (Highlights the rapid progress in AI-generated imagery, raising both excitement and concerns about potential misuse.)
Setting a new bar for medical and financial model performance with Writer LLMs (Shows the increasing specialisation of AI models for specific industries, potentially improving accuracy and utility.)
Canva acquires Leonardo.ai to boost its generative AI efforts (Illustrates the trend of established tech companies acquiring AI startups to enhance their capabilities.)

AI governance news

Striking U.S. video game actors say AI threatens their jobs (Highlights growing concerns about AI’s impact on creative industries and labour markets.)
UK’s AI bill to focus on ChatGPT-style models (Indicates the UK’s approach to AI regulation, focusing on large language models and their potential risks.)
EU AI Act in infancy, but using ‘intelligent’ HR apps a risk (Demonstrates the complexities of implementing AI regulations and their potential impact on business practices.)
Entertainment industry gets behind new bill that will outlaw AI deepfakes (Shows the growing push for legislation to combat AI-generated misinformation and protect individuals’ likenesses.)
OpenAI pledges to give U.S. AI Safety Institute early access to its next model (Indicates a move towards greater transparency and collaboration between AI companies and regulatory bodies.)

AI research news

SAM 2: Segment Anything in Images and Videos (Represents a significant advancement in computer vision, potentially improving various applications from autonomous driving to medical imaging.)
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent (Showcases progress in real-time language translation, which could revolutionise global communication.)
Generative AI in Real-World Workplaces (Provides insights into how AI is being integrated into various industries, offering valuable data for businesses and policymakers.)
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain (Demonstrates the increasing specialisation of large language models for specific industries, potentially transforming legal research and practice.)
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher (Explores new approaches to AI search algorithms, potentially improving information retrieval and decision-making processes.)

AI hardware news

AMD’s Instinct GPUs drove $1B in revenues in Q2 (Highlights the growing demand for AI-specific hardware and increasing competition in the GPU market.)
US launches antitrust probe into Nvidia over sales practices (Indicates increasing regulatory scrutiny of dominant players in the AI hardware market.)
AI chipmaker Cerebras confidentially files for US IPO (Demonstrates the growing investor interest in AI hardware companies and the maturing of the AI chip market.)
Intel announces plan to cut 15,000 jobs to ‘resize and refocus’ business (Reflects the challenges faced by traditional chip manufacturers in adapting to the AI-driven market.)
Samsung Begins Closing Gap With SK Hynix in Making AI Memory Chips for Nvidia (Illustrates the intense competition in the AI chip manufacturing space and the strategic importance of these components.)