Skip to main content

The Falcon has been released amongst the pigeons? The Falcon-40B

 


Monopolies like to retain their position. So it makes sense if a trio of monopolistic tech companies call on the legislators of the world to demand that all LLMs be subject to a license, especially when such license's may restrict competitors introducing cheaper, more efficient models into the opensource community. 

So let's talk about Falcon-40B. No paper exists for this yet (it states on hugging face that a paper is coming soon). There are many things that make Falcón of interest, one is that it has been built by TII, the Technology Innovation Institute, that are 'part of Abu Dhabi Government’s Advanced Technology Research Council, which oversees technology research in the emirate. As a disruptor in science, we are setting new standards and serve as a catalyst for change.' So not another US company. 

The company's website is informative:

'Falcon, first unveiled in March 2023, showcased exceptional performance and underscored the UAE's commitment to technological progress. Based on Stanford University’s HELM LLM benchmarking tool, Falcon 40B outperformed its renowned counterparts in utilizing significantly less training compute power. With only 75 percent of the training compute of OpenAI's GPT-3, 40 percent of DeepMind's Chinchilla AI, and 80 percent of the training compute of Google's PaLM-62B, the tool substantiated TII's commitment to advancing developments in generative AI.'

'Dr. Ebtesam Almazrouei Director, AI Cross-Center Unit, TII, said: “The open-source release of Falcon 40B, 7.5B, and 1.3B parameter AI models and our high-quality REFINEDWEB dataset, exemplifies the profound scientific contributions of the UAE. With each breakthrough, we defy limitations, reshape the realm of possibilities, and pave the way for collaborative efforts with transformative impact."

The licensing of this model is of interest to. It's released via an Apache 2 license, it's open for commercial usage, up to the value of $1m, after which is subject to fees. The Falcon has been released amongst the pigeons. 

Comments

Popular posts from this blog

The Whispers in the Machine: Why Prompt Injection Remains a Persistent Threat to LLMs

 Large Language Models (LLMs) are rapidly transforming how we interact with technology, offering incredible potential for tasks ranging from content creation to complex analysis. However, as these powerful tools become more integrated into our lives, so too do the novel security challenges they present. Among these, prompt injection attacks stand out as a particularly persistent and evolving threat. These attacks, as one recent paper (Safety at Scale: A Comprehensive Survey of Large Model Safety https://arxiv.org/abs/2502.05206) highlights, involve subtly manipulating LLMs to deviate from their intended purpose, and the methods are becoming increasingly sophisticated. At its core, a prompt injection attack involves embedding a malicious instruction within an otherwise normal request, tricking the LLM into producing unintended – and potentially harmful – outputs. Think of it as slipping a secret, contradictory instruction into a seemingly harmless conversation. What makes prompt inj...

Can We Build a Safe Superintelligence? Safe Superintelligence Inc. Raises Intriguing Questions

  Safe Superintelligence Inc . (SSI) has burst onto the scene with a bold mission: to create the world's first safe superintelligence (SSI). Their (Ilya Sutskever, Daniel Gross, Daniel Levy) ambition is undeniable, but before we all sign up to join their "cracked team," let's delve deeper into the potential issues with their approach. One of the most critical questions is defining "safe" superintelligence. What values would guide this powerful AI? How can we ensure it aligns with the complex and often contradictory desires of humanity?  After all, "safe" for one person might mean environmental protection, while another might prioritise economic growth, even if it harms the environment.  Finding universal values that a superintelligence could adhere to is a significant hurdle that SSI hasn't fully addressed. Another potential pitfall lies in SSI's desire to rapidly advance capabilities while prioritising safety.  Imagine a Formula One car wi...

The Hidden Environmental Cost of AI: Data Centres' Surging Energy and Water Consumption

 In recent years, artificial intelligence (AI) has become an integral part of our daily lives, powering everything from smart assistants to complex data analysis. However, as AI technologies continue to advance and proliferate, a concerning trend has emerged: the rapidly increasing energy and water consumption of data centres that support these systems. The Power Hunger of AI According to the International Energy Agency (IEA), global data centre electricity demand is projected to more than double between 2022 and 2026, largely due to the growth of AI. In 2022, data centres consumed approximately 460 terawatt-hours (TWh) globally, and this figure is expected to exceed 1,000 TWh by 2026. To put this into perspective, that's equivalent to the entire electricity consumption of Japan. The energy intensity of AI-related queries is particularly striking. While a typical Google search uses about 0.3 watt-hours (Wh), a query using ChatGPT requires around 2.9 Wh - nearly ten times more en...