Open-Source - TechHQ

Hugging Face Safetensors vulnerable to supply chain attacks

TechHQ — Thu, 07 Mar 2024 09:30:55 +0000

• Hugging Face vulnerabilities revealed.
• Supply chain attacks can get into Hugging Face safetensors.
• That means the whole Hugging Face community could be under threat.

Vulnerabilities in Hugging Face

Safetensors, a format designed by Hugging Face, store tensors, prioritizing security. Users can also convert PyTorch models to Safetensor through a pull request if desired. Safetensors is in contrast to “pickles,” another format, which may have been exploited by malicious actors to deploy tools such as Mythic and Cobalt Strikes, and run unauthorized code.

The recent revelation of possible vulnerabilities comes as a shock to many of Hugging Face’s 1.2 million registered users. It became evident through the research that malicious pull requests could be accomplished via a hijacked model. Since the service should convert this model, it enables harmful actors to pose as the conversion bot and request modifications to any repository on the platform.

It’s also possible for hackers to extract tokens associated with SFConvertbot. This is a bot made to generate a pull request. These tokens can be extracted, sending out a dangerous pull request to any repository on the Hugging Face site. From here, a threat actor could manipulate the model, even implanting neural backdoors.

According to researchers, “an attacker could run any arbitrary code any time someone attempted to convert their model.” Essentially, a model could be hijacked upon conversion without the user even knowing it.

An attack could result in the theft of a user’s Hugging Face token if they try to convert their personal repository. Hackers may also be able to access datasets and internal models, resulting in malicious interference.

The complexities of these vulnerabilities don’t stop there. An adversary could exploit the ability for any users to submit a conversion request for a public repository, resulting in a possible modification or hijacking of a widely utilized model. That poses a substantial risk to the overall supply chain. Researchers summed this up by saying, “the conversion service has proven to be vulnerable and has had the potential to cause a widespread supply chain attack via the Hugging Face official service.”

Attackers could get access to a container that runs the service, and choose to compromise any models that have been converted by it.

Hugging Face – traditionally, bad things happen afterwards…

LeftoverLocals

Hidden Layer’s exposure to certain vulnerabilities comes just one month after Trail of Bits revealed a vulnerability known as LeftoverLocals (CVE-2023-4969, Common Vulnerability Scoring System (CVSS) score – 6.5). This particular security flaw enables the retrieval of data from general-purpose graphics processing units (GPGPUs), manufactured by Apple, AMD, Qualcomm, and Imagination. The CVSS score of 6.5 indicates that this vulnerability was on a moderate level of severity, putting sensitive data at risk.

Trail of Bits’ memory leak stemmed from a failure to isolate process memory. Therefore, a local attacker could gain access and read memory from various processes. This includes the interactive sessions of other users within a Large Language Model (LLM).

Europe becomes data-interoperable – but do its open standards fall short of the mark?

Joe Green — Thu, 08 Feb 2024 15:30:43 +0000

EU’s Interoperable Europe Act comes into force.
Open standards, reusability and data exchange.
Latest in range of laws to limit big tech monopolies.

The adoption on February 6th, 2024, of the EU’s Interoperable Europe Act will help ensure that public bodies in the Community use software and systems that can exchange information and technology freely between them. It aims to promote the reuse of data between public bodies, reducing separate silos of what can be effectively the same information, and to actively deploy and use systems that will make sure that happens easily.

Open standards and interoperability

Here’s why restricting China from RISC-V tech would hurt the US

Dashveenjit Kaur — Tue, 10 Oct 2023 12:23:19 +0000

US lawmakers want Biden to impose export restrictions around RISC-V.
US firms like Qualcomm and Google have embraced RISC-V.
But would such restrictions damage US business prospects too?

The US export controls imposed in late 2022 kept the US busy for a brief period, with the Biden Administration gathering allies and closing all possible loopholes for China to strengthen its technological prowess — especially in semiconductors. Then Huawei released its 5G-enabled Mate 60 smartphone collection, despite the mounting sanctions against the company.

What is RISC-V, and why is it the center of the US-China tech war?

RISC-V (pronounced as “risk five”) is an open standard instruction set architecture (ISA) for computing that competes with costly proprietary technology from British semiconductor and software design company Arm Holdings. As a global standard, RISC-V is not controlled by any single company or country. Therefore, it has become a new hope for China to reduce its dependence on foreign intellectual property (IP) suppliers amid an escalating tech war with the US.

RISC-V can be a crucial ingredient for anything from a smartphone chip to advanced processors for AI. According to RISC-V International, the global non-profit home of the open standard RISC-V ISA, related specifications, and stakeholder community, the development of RISC-V specifications is based on contributions that have been made available on a non-proprietary basis or cultivated in the open from members evenly distributed in North America, Europe, and Asia.

“The only difference is that the marketplace can use these standards without proprietary licenses from a controlling company. Competition does not happen at the standards level. Still, rather competition is at the implementation level,” the organization said, adding that RISC-V has ushered in tremendous potential for companies worldwide to participate in the rapidly growing semiconductor space.

The chief executive of RISC-V International said that possible government restrictions on the open-source technology will slow down the development of new and better chips, holding back the global technology industry.

Experts and analysts have stood firmly by RISC-V, which has grown tremendously in global adoption and influence as the open computer standard. “The entire tech ecosystem benefits from standards being open, whether it’s RISC-V or other popular standards such Ethernet, HTTPS, JPEG, or USB,” Calista Redmond, chief of RISC-V International, said in a blog posting.

Unfortunately, US lawmakers, including both Republican and Democratic senators, are urging the Biden administration to take action on RISC-V on national security grounds, according to a Reuters report. The move marked the first time US politicians have considered restricting the tech standard.

China leads the RISC-V adoption

Despite its origins in 2010 at the University of California, RISC-V is overseen by a Swiss-based non-profit foundation, coordinating efforts among for-profit companies to develop the technology. RISC-V has found favor in China as a potential means to navigate technology restrictions imposed by the US.

This means having access to open standards allows companies, including those from China, to innovate faster and spend their time creating differentiated products rather than trying to reinvent the wheel. “Just as companies everywhere have adopted Ethernet, HTTPS, JPEG, and USB standards, we’re seeing a similar trend for RISC-V as an open standard. The flexibility, extensibility, and scalability of RISC-V give developers unparalleled design freedom,” Redmond stated.

This move aligns with commercial motivations, including cost reduction and diversification away from Arm, the British semiconductor design giant. Under US regulations, Arm faces constraints on selling specific advanced intellectual properties to Chinese clients.

Similarly, US electronic design automation firm Synopsys can only offer a limited-function version of its software to Chinese companies like Huawei Technologies, as confirmed by a Synopsys engineer during a RISC-V event in Beijing earlier this year.

According to an article by the South China Morning Post (SCMP), Of the 21 premier members of RISC-V International, nearly half are Chinese, including Alibaba Cloud, Huawei, ZTE, and Tencent Holdings.

What’s next?

The dynamics of open standard technology is such that if President Biden’s administration were to regulate US companies’ participation in the Swiss-based foundation in the way lawmakers are seeking, it would complicate how American and Chinese companies work together on open technical standards.

How is Moodle advancing the AI education debate?

Nathan Hew — Mon, 09 Oct 2023 14:15:33 +0000

What is Moodle and how is it engaging with the AI education debate?
The themes for the MoodleMoot Global 2023 included the impact of AI on both education and work.
Moodle’s 4.3 release will likely fix more than 300 bugs and add a host of new features.

What is Moodle?

The potential of artificial intelligence in eLearning

MoodleMoot Global 2023 covered a wide range of interests and expertise, including:

The use of augmented and virtual reality in education and training
How AI is changing education and the workplace
Building core competencies with Moodle
Addressing inclusivity and equity with Moodle courses
Soft skills revolution — strengthening learners’ critical thinking, interpersonal & creative skills
Using Moodle to support Science, Technology, Engineering, and Mathematics (STEM) delivery

On the morning of Day Three at MoodleMoot Global 2023, a panel discussion explored the transformative power of AI in education and workplace learning. The session, titled “How artificial intelligence is changing education and the workplace” was hosted by Brett Dalto, Head of Education Solutions at Moodle HQ.

It featured a host of experts, including Heikki Wilenius from the University of Helsinki, Tim Hunt from The Open University UK, Elizabeth Dalton from IntelliBoard, Rajnish Kumar from Verificient, and Meghan Mencer of Harnessing Your Potential.

Dalto posed three questions to the panel: are our educational institutions equipped or prepared to address potential AI? How will regulating AI impact the education industry? and how will AI have the greatest positive impact on education?

Moodlin’ along.

Discussing bias in AI, Dalton suggested that we need to broaden our data to be inclusive of all demographics for AI to be unbiased. Conversely, Kumar from Verificient argued that we should consider the intention behind building an AI system rather than focusing on whether AI is biased.

The panelists also raised thought-provoking questions. Hunt, for example, questioned whether we understand the implications of AI well enough to draft effective legislation. The conversation also touched on how current and future generations will adapt to the growing impact of AI in society.

MoodleMoot 2023 covered a lot of ground involved in the AI education debate as it stands in 2023. How many of the issues will be resolved by the time of MoodleMoot 2024? Watch this space.

The post How is Moodle advancing the AI education debate? appeared first on TechHQ.

Meta, Microsoft release new AI language model for commercial use

Dashveenjit Kaur — Wed, 19 Jul 2023 17:31:31 +0000

• The laest AI model by Meta, LLaMA 2, is available to major cloud providers, including Microsoft.
• Qualcomm is scheduled to make LLaMA 2-based AI implementations available on flagship smartphones and PCs starting in 2024.
• LLaMA models are available at three levels of pre-training.

Meta has intensified the generative AI race by unveiling its latest large language model, LLaMA 2, which will be open-source and free for commercial and research use. The move puts the social media company in a position to go head-to-head with OpenAI’s free-to-use GPT-4, which powers tools like ChatGPT and Microsoft Bing.

What’s new with the latest AI model by Meta?

LLaMA 2 is the first project to come out of the company’s generative AI group, a new team assembled in February 2023. According to Zuckerberg, LLaMA 2 has been pre-trained and fine-tuned on models with 7 billion, 13 billion, and 70 billion parameters. “LLaMA 2 was pre-trained on 40% more data than LLaMA 1 and had improvements to its architecture,” he said.

YOU MIGHT LIKE

GENERATIVE AI

Can generative AI ever be safe to use with proprietary data?

It also says it “outperforms” other LLMs like Falcon and MPT in terms of reasoning, coding, proficiency, and knowledge tests. For the fine-tuned models, Zuckerberg said Meta had collected more than one million human annotations and applied supervised fine-tuning and reinforcement learning with human feedback (RLHF) with leading results on safety and quality.

Meta developed and released the LLaMA 2 family of large language models (LLMs), a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. Source: Meta

Meta also announced that Microsoft would distribute the new version of the AI model through its Azure cloud service and will run it on the Windows operating system.

Meta said in its blog post that Microsoft was its “preferred partner” for the release. In the generative AI race, Microsoft has emerged as the clear leader through its investment and technology partnership with ChatGPT creator OpenAI, which charges for access to its model.

“Starting today, LLaMA 2 is available in the Azure AI model catalog, enabling developers using Microsoft Azure to build with it and leverage their cloud-native tools for content filtering and safety features. It is also optimized to run locally on Windows, giving developers a seamless workflow as they bring generative AI experiences to customers across different platforms,” the tech giant said.

Meta said LLaMA 2 is available through Amazon Web Services (AWS), Hugging Face, and other providers.

Qualcomm partners with Meta to run LLaMA 2 on phones

Shortly after Meta unveiled LLaMA 2, Qualcomm announced that it is partnering with the tech giant for the new large language model. “Qualcomm Technologies Inc. and Meta are working to optimize the execution of Meta’s LLaMA 2 large language models directly on-device – without relying on the sole use of cloud services,” Qualcomm said.

For the US chip designer, the ability to run generative AI models like LLaMA 2 on devices such as smartphones, PCs, VR/AR headsets, and vehicles allows developers to save on cloud costs and provide users with private, more reliable, personalized experiences. Qualcomm is scheduled to make LLaMA 2-based AI implementation available on devices powered by Snapdragon from 2024 onwards.

“We applaud Meta’s approach to open and responsible AI and are committed to driving innovation and reducing barriers-to-entry for developers of any size by bringing generative AI on-device,” said Durga Malladi, senior vice president and general manager of technology, planning, and edge solutions businesses, Qualcomm Technologies, Inc.

Malladi believes that to scale generative AI into the mainstream effectively, AI will need to run on both the cloud and devices at the edge, such as smartphones, laptops, vehicles, and IoT devices.

The post Meta, Microsoft release new AI language model for commercial use appeared first on TechHQ.

Knowledge graph technology: sharpening data visibility for better decisions

Tony Fyler — Tue, 20 Jun 2023 18:18:05 +0000

• Knowledge graph technology is a new way of visualizing data across organizations.
• It can help inform and guide stronger business decisions.
• It’s using open-source generative AI to deliver focused data.

Knowledge graph technology is re-writing the way in which objects, people, companies and supply chains can be visualized, examined, and mined for data that can help bring efficiency savings, deal with data reporting requirements and a lot more.

In Part 1 of this article, we sat down with Paul Hopton, CTO at Scoutbee, a leading company offering knowledge graph technology to enterprise clients, to understand how it could be used in supply chains.

But while we had Paul in the chair, we decided to take a deeper dive into knowledge graph technology as it applied to corporate governance and better decision-making.

GenAI delivering knowledge graph technology.

THQ:

Correct us if we’re wrong here, but you use generative AI to deliver your knowledge graph technology, right?

PH:

We do, but it’s not… quite ChatGPT as we know it, Captain.

THQ:

Intriguing. How so?

PH:

Effectively we have our knowledge graph, where we capture all the information which we find, and then we make elements of that graph available to the customer. What’s very important, both in terms of thinking about knowledge graph technology, but also AI in general, is having multi-tenancy support.

Knowledge graph technology improves data focus.

THQ:

Ah yes. We’ve spoken to other companies doing different things generative AI, and how it can be used in ways to really boost a company’s productivity, and that seems to be key to all of the standout offerings, that focus on either the area of interest or the company specifically. Training with specific data, rather than training a sort of more generalized AI down into that scenario.

Finding valuable data and drawing business conclusions – priceless.

PH:

Exactly. I mean, you can ask ChatGPT “Who was the star of a 1970s TV film?” And it will give you an answer. You can ask it to explain Foucault’s theorem or, and it will come up with some kind of answer. Which is great in terms of use by the general public, but not strictly relevant to a lot of enterprises.

We’ve built our models on open-source models. They’re smaller, but they don’t need to know anything about TV stars, or mathematicians, or how to bake an apple pie.

They need to know about suppliers, and products, and certifications. They understand geography. They understand things that are pertinent to the task of improving our customers’ knowledge of their own company and their relationships with others.

YOU MIGHT LIKE

GENERATIVE AI

SPLOG – the data management issues facing generative AI

That still means we’ve been working with 7 billion data point models, and we’re now moving up to some 14 billion point models, which give us much better, much more interesting results. But we don’t need to have the same kind of scale that ChatGPT or Bard will do, because we’re solving a niche problem.

That specialist knowledge is really valuable. And having all that information in the knowledge graph database which the AI can interrogate feels exciting, and has been clearly shown to add value to our customers’ businesses.

Knowledge graph technology and open-source.

THQ:

Was it that idea of smaller, more focused generative AI models that drew you to open-source? We remember the ripple of terror that went through the big players when it became clear that the open-source community were getting their hands on generative AI models, precisely because they could do more focused, flexible things with significantly less compute and cost.

PH:

It’s a story we’ve seen time and again. Things which are supposedly going to change the world, and it’s rarely while they’re monopolized by big companies that it happens. All the innovative stuff is now sitting on open-source systems. Information wants to be free, and it’ll will find a way of becoming free. And that’s what the open-source movement is done. And we had to take advantage of that.

We’re comfortable that we can still build a good business model on top of this. Because what we essentially do is use the AI to give people better access to the information which has already been gathering in their systems, and which they shared with us.

It’s that kind of building up that makes the difference. Here’s the data we found from the internet, let’s use it in our knowledge graph technology solution. Here’s the data which you’ve provided, which enriches the knowledge graph.

Knowledge graph technology – like genome sequencing for your business.

Now we’re looking at how we integrate other documents and information that organizations have, to build a much richer AI model for this.

One of the things we talk to our customers about a lot at the moment is the importance of starting to build that out now. If we jump two years into the future, companies that haven’t started engaging with the AI now are going to be having to ask hard questions, and having to answer hard questions from their shareholders.

Knowledge graph technology – norm of the future?

THQ:

Are we confident then that knowledge graph technology is a norm of the future?

PH:

Well… we are, yes. You kind of have to be in it to win it. The people who are working on this now, in two years’ time, will have a very smart, sophisticated AI system, which understands everything that they want to do.

THQ:

That’s the point with generative AI, isn’t it? It was launched with a bang, and it’s had a contradictory life since then, because on the one hand, it’s been adopted by almost everybody and put it into almost everything.

And on the other hand, it’s had quite a few big players and big scientists come back and ask hard questions about whether we really want to do this, as fast as we’re doing it.

But with the open-source option, firstly, you’re not building anything that can necessarily escape its limited data paradigm, and, as is always the case with open-source, it’s the more people you have working on different elements, the more problems you solve.

PH:

Exactly. And I think the capability you have of doing something very destructive is limited when you’re working with a comparatively small open-source model.

Legislation will be necessary at the upper end of the scale, but that’s not really where we are, and the point is, it’s not really where our customers need us to be. They need our models to be focused on their companies, their data points, and their supply chains.

THQ:

As you say, jettison the apple pie recipes.

PH:

Right?

Knowledge graph technology – a new way of looking at data.

There are players in the field who’ve seen the advantage of being able to learn incrementally. Knowledge isn’t a finished thing that you can start at the top left and work down to the bottom right. It grows and grows, organically and in different directions.

That’s why companies like LinkedIn have started using knowledge graph technology – a person’s a person, but graphing what that means and understanding that person through their professional life and their interactions with careers, is quite hard to think about.

LinkedIn uses knowledge graph technology already.

Putting them in a table, that’s maybe nice for a coding exercise if you’re learning a new programming language, but that’s not what you’re ever going to build a business with.

THQ:

A person’s a person, no matter how small… but they’re also a data point with several connecting data points.

PH:

Exactly. I think our typical supplier is probably around 150 interconnected data points. Not mapped in columns and rows, but as a bunch of connected nodes.

And the AI helps us find relationships and nodes which we didn’t see before. And each new relationship and each new node is a potential unit of added value for the company that has it.

That’s the ongoing power of knowledge graph technology.

The post Knowledge graph technology: sharpening data visibility for better decisions appeared first on TechHQ.

Generative AI a threat to human survival – CAIS

Tony Fyler — Tue, 30 May 2023 18:26:33 +0000

• New warnings against the generative AI threat from experts.
• Potential to skew the 2024 election with AI deepfakes.
• The human challenge is to report use of deepfakes.

Generative AI is as big a threat to human survival and society as pandemics or nuclear war. That’s according to the Center for AI Safety in a new statement, which practically begs the powers-that-be to take action to reduce what it calls “extinction-level risk” from the new technology.

There have of course been calls for slowdowns, re-thinks, and a re-corking of the bottle that held the generative genie before now – academics and business leaders have warned that we don’t yet know enough about the technology to set it as free as we have done in a wide range of businesses, from which it’s unlikely we’ll be able to unpick it down the line.

Voices of concern.

Ironically enough, the open letter from the Future of Life Institute, which was the first such organization to call for a pause, was probably robbed of some substance by the involvement of Elon Musk in the call.

The disaster movie cliché.

Hinton, it should be noted, is a signatory of the new statement from the Center for AI Safety. As is Open AI CEO Sam Altman. And John Schulman, co-founder of OpenAI. As are both Kevin Scott, Chief Technology Officer at Microsoft, and Eric Horvitz, the company’s Chief Scientific Officer. And Lila Ibrahim, Chief Operations Officer at Google DeepMind…

We’re not about to turn this into a roll-call of the great, the good, and the extremely clever, but as with the Future of Life Institute, the Center for Safe AI’s statement is signed by emeritus professors, AI specialists, and active researchers from some of the finest academic institutes in the US, and the world.

And if there’s one tired cliché that can be relied upon in every science fiction B-movie out there, it’s that lots of clever scientists warn of the impending disaster at the start – and are ignored, with devastating, popcorn-chewing results for the next 90 minutes.

Pandemics, nuclear war, generative AI.

As if the Future of Life Institute letter, which openly talked about the potential of generative AI to lead us to a kind of personal extinction wasn’t bald and hysterical-sounding enough, the Center for Safe AI makes precisely zero bones about the scope of the problems it claims generative AI can lead us to.

“Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war.”

That’s the whole of the statement.

The tech and business communities have already identified a solid handful of risks inherent in the wholesale application of generative AI. In essence, it democratizes altogether too many processes and puts them in the hands of well-meaning idiots.

Or indeed, harm-meaning idiots.

YOU MIGHT LIKE

OPEN-SOURCE

What might open-source generative AI mean for proprietary software?

Those processes can include developing prompt-based shell scripts and apps in a coding language you have no idea how to write – taking the expertise out of programming and coding.

They can include writing copy which can be persuasive, engaging, and yet objectively, factually wrong.

And they can include creating phishing and malware bots without much in the way of understanding of the technology involved.

Generative AI deepfakes – the death of truth?

There are also significant concerns about the data on which leading – already adopted – generative AI bots have been trained, and the data they collect and then on some level can own and use, as has been shown to be valid in the case of Samsung in early May, 2023.

The company’s error forced a ban on the use of ChatGPT by its staff, for fear of giving away any more proprietary code to the generative AI.

But one of the biggest concerns over generative AI as we head into the 2024 election season is the rise and rise of the technology in terms of creating convincing deepfake images, videos and even audio footage with AI voices.

There is already deepfake footage circulating on the internet of Governor Ron DeSantis, a challenger for the Republican Presidential nomination, merged with an episode of The Office, which seems to discredit the governor – on this occasion at least, unfairly.

Fake narratives.

There are two points to watch in developments like this.

Firstly, former President Trump, who is – depending on the outcome of several lawsuits – running to be President again in 2024, was both a candidate and a president entirely unperturbed by a lack of evidence to support his claims.

A question of responsibility.

Secondly though, beyond the Trump factor, the increasing availability of generative AI-based deepfakes and voicefakes threatens the very nature of “truth” in any political campaign.

As in coding, for instance, where you need experienced coders to be able to tell what is wrong with generative AI-written code and put it right, and as in copywriting, where disclaimers about copy being written by AI and fact-checked by human beings are popping up more and more, so there is a need, in a world where these technologies are increasingly commonplace, for news organizations – and political organizations and figures – to own their use of AI-generated fakes whenever they do it, so they can be distinguished from objective and fact-based reporting or video.

Unfortunately, if social media has taught us anything, it’s that facts stand up poorly to a need for people to feel right in their own confirmation biases.

So in the world of regularly available generative AI deepfakes, the idea that anyone will know what “objective truth” is becomes increasingly easy to water down – both across the political spectrum and across the world.

The AI election.

Everyone from conspiracy theorists (“Shock newly discovered footage proves we never went to the Moon!”) to political theorists on both sides of the aisle, to China, to Russia, will be able to use the technology to “prove” their version of reality, and all they need to do is not disclose that it’s an AI deepfake in order to make their audiences incensed against any opponent they choose.

When what you see automatically becomes “the truth,” who wags your dog?

With some news organizations already calling 2024 “the AI election,” the big question is whether concepts like democracy and truth can actually survive long into the AI deepfake era.

As is often the case when the potential danger of generative AI is discussed though, the technology itself is not the real threat. It is, to plagiarize the NRA, only the gun in the hand of the user.

The true test of the AI election – and the world as it looks with so much more generative AI underpinning everything we understand to be true (and on which we base our decisions) – is the honesty of intent of the people using the technology.

Will every news organization and every political campaign agree to signal the fakery of its content, every time it uses generative AI?

We are not, in the final analysis, a culture that has traditionally shown itself able to exercise such power responsibly over extended periods in recent years.

The post Generative AI a threat to human survival – CAIS appeared first on TechHQ.

What might open-source generative AI mean for proprietary software?

Tony Fyler — Thu, 18 May 2023 20:14:07 +0000

Six months ago, when generative AI first exploded onto the tech world’s consciousness like a sentient tab of acid, offering answers to every question in the knowable cosmos, there was one very noticeable thing about it. While the name officially attached to ChatGPT was OpenAI, a research company with a very tight focus, the power behind the newly-invented throne of generative AI was Microsoft, and its piles and piles of shiny, burning research dollars.

The ensuing scramble to join the generative AI gold rush and establish a claim in the sudden “new world” was very much a race of the tech giants. Google practically fell over itself in its hurry to establish its Bard as a viable alternative to ChatGPT.

Rise and stumble.

Generative AI, and ChatGPT in particular (capitalizing on its first-to-market exclusivity) had a fairly messianic few months – going from being everybody’s favorite new toy and the herald of a brave new world of possibilities, to having programmers question the wisdom of democratizing the coding process, to leading AI scientists quitting Google on the basis of the potential that generative AI could become smarter than humans in a hurry and just possibly kill us all.

Italy put it in time out while it sought assurances on its data practices. Samsung fell foul of a lack of awareness of those data practices, unthinkingly giving ChatGPT some of its proprietary code and subsequently banning all use of the technology. China, as we mentioned, had a spectacularly socialistic hissy fit. And a collection of esteemed academics, industry figures, and ultimately anybody who felt like it, added their name to an open letter asking the industry for a pause in development of generative AI above the capabilities of GPT-4.

Sam Altman of OpenAI, testifying before Congress this week, acknowledged that the potential of generative AI was scary, and confirmed that whatever will eventually become GPT-5 (or ideally, something with a much catchier name) has not begun training yet, and won’t for at least the next six months.

An understood model of the world.

But the salient point is that all of this happened in a world where the model was familiar – multi-billion-dollar companies funding significant advances that they would eventually add to their product rostra and either charge for directly, or monetize in other ways. They were the kings of this advance, and the development, progress, speed and above all, the price of the advance would be theirs to dictate.

It was Scottish poet Robert Burns who famously said “The best laid plans of mice and multi-billion-dollar tech giants aft gang aglay.” Or so ChatGPT tells us.

And aglay (astray, or wrong) those plans duly went, when a version of Meta’s foundation model, LLaMA (erratically acronymical spelling, but immediately more memorable than ChatGPT) was leaked to the open-source community.

The open-source community, in case you’re new to TechTown, is part army, part ant colony, millions if not billions strong, based all around the world, very techno-geeky and essentially composed almost entirely of the kind of people who could get us not only to Mars but out of the solar system before NASA had got its space boots on, so long as somebody said it couldn’t be done.

The certainties of life.

There are very few certainties in life – death, taxes, occasional crushing political disappointment and the fact that you look neither as good nor as bad as you sometimes think you do.

But if there is a single certainty on which the world literally depends in the 21^st century, it’s that things get better when the members of the open-source community get their hands on them. Often cheaper, too, but always, always better.

That came to light late last week when a memo was supposedly leaked from an unnamed Google staffer, listing the many reasons why the traditional proprietary software houses could and probably should be losing their collective minds over the fact that the open-source community has generative AI code to play with.

And while we may never be entirely sure a) whether it came from a genuine Google staffer, or b) whether the views expressed in the memo are in any way indicative of Google’s private corporate internal monolog right now, neither of those things will ultimately matter, because the achievements documented in the memo are real, and verified, and have a defined timeline.

Things like LLMs on a phone, fully-functioning generative AI that take only around the power of a handful of threadrippers to use, rather than the resource-intensive versions of the technology, developed and deployed by the tech giants.

Most particularly of all, there are two ideas in the “leaked memo” that might well revolutionize the way the world interacts with generative AI.

The meaning of the LLaMA leak.

OpenAI may not be working on GPT-5 just yet. We’d be willing to bet that somewhere in a bedroom or a basement, someone is. Only it will be faster, and more useable, and more versatile, and crucially of course – almost insultingly cheaper.

What does all this mean for the proprietary generative AI giants? We suspect they’ll be trying to figure that out themselves. The idea of significant regulation of the technology was probably necessary in any case, but has gained support from some of the big players in relatively recent days. Could that curtail the operations of the open-source community?

Maybe – it could arguably impose rules around what could be legally developed except by players who were able to expensively commit to principles of corporate responsibility, creating a monopoly of capital investment that would shut out open-sourcers from actively profiting from their work.

There’s always the potential for a giant intake of open-source coders into the ranks of the tech giants, binding the coders and their developments to the advancement of the companies in return for a hefty sack of cash. That’s an extremely short-term solution, and only really half of one – the open-source community is also akin to a hydra: for every head you remove, two more spring up in its place, and then six months or a year down the line, you’re being out-developed again.

There’s the potential to sue for IP rights, but that’s practically impossible and highly frustrating – only Meta would realistically have a claim, and it could be easily argued that it gains much more by association with the ways in which the open-source community has improved generative AI on the initial basis of its foundation model than it would have by restricting use of that model to a monetized version and a relatively hard-won user-base.

Besides which, since the original leak, there are probably already a thousand “children” of the original model, all of which are significantly different enough from the parent, leaked version to warrant an individual identity. At which point, the only people getting rich are the lawyers.

The future, or something like it.

The most likely result is that the tech giants will have to grin and bear it. But for those predicting the end of the proprietary world in terms of generative AI, there’s bad news, too.

The market will likely settle and stratify, in much the same way as it has done in relation to other business tools – you have your Microsoft 365s, your Google Workspaces… and then you have a host of others that do similar things, but probably, when all is said and done, better. Less well known, and with less famous support networks in the event of anything going wrong, but out there and thriving, developing faster and in more bespoke ways than the behemoths can match. And cheaper. Always, always cheaper.

In terms of generative AI, the difference between the strata is likely to be more extreme and noticeable – at least until the giants begin aggressively copying the open-sourcers in providing smaller, lighter, more agile generative AI setups that can be customized and trained easily by the client with bespoke datasets relevant to their needs. (It might also be the case that the value of datasets rockets in response to these developments – like getting a cheap, fast, efficient games console, only for the price of the games to go up).

By which time, the open-source community is likely to have launched and grown a thriving market in exactly that kind of more personalized AI product, and established significant amounts of customer loyalty as a result.

The open-source invasion of generative AI is not, as such, the end of the world for the proprietary tech giants and their AI investments. But it does mean a relative democratization of the technology, which will strike a very great number of businesses – not to mention enthusiastic individuals – as an extremely attractive alternative to paying big business prices for less agile models.

The post What might open-source generative AI mean for proprietary software? appeared first on TechHQ.

Open-source coders have generative AI now – and it could change everything

Tony Fyler — Mon, 15 May 2023 20:03:41 +0000

Just six months ago, on November 30^th, 2022, OpenAI, backed by Microsoft, dropped a bomb on the tech world, in the form of ChatGPT. Since then, the tech industry has lost its collective mind and invested everything up to and including the family silver in generative AI – the new big prize, the new wundertool, the revolutionary technology that would change the world.

And there’s little doubt that it has, or that it will continue to do so. Every company under the sun has found some use for generative AI.

Google, outflanked by the OpenAI/Microsoft launch, burned its year’s supply of midnight oil to get its competitor, Bard, to the world in something that could just about be seriously considered good time. And a new technological arms race was declared, to become kings of generative AI.

The drag factors.

Except what also happened was that ChatGPT, GPT-4, Bard and others, ran into significant issues. Their lack of an objective truth model and the sheer size of their data libraries made them prone to convincing error. Open letters were written by the great and the self-aggrandizing, demanding a pause in the development of the technology. Italy raised legitimate concerns over data privacy. China had a puritanical hissy fit about generative AI trained on anything other than solidly socialist models.

And while companies all over the world, and at every scale, set about integrating large language model generative AI into their business practices, quietly in March, Meta’s new LLaMA platform was leaked to the open-source community.

It’s probably worth a refresher course in what happens when the open-source community gets its hands on a new toy.

The short answer is “practically everything useful you think is developed by major tech giants.”

And now, a document that purports to be a leaked internal memo from Google is painting an alarming picture for the tech giants – and an extremely attractive one for companies and people who want generative AI to do specific things, and who don’t necessarily want to pay tech giant bucks to get it done.

The flavor of the memo is perhaps conveyed in an early line. “While we’ve been squabbling, a third faction has been quietly eating our lunch. I’m talking, of course, about open-source. Plainly put, they are lapping us.”

The open-source army.

There’s a certain irrevocable logic to this. You can lock a thousand coders and programmers in a basement in OpenAI or Google HQ and tell them to be creative or the puppy gets it. They’ll produce impressive things, to be sure.

But the open-source community is millions, if not billions strong. And they work independently and in teams to solve problems. To smooth out bugs. To build cute new things that nobody ever knew they needed. The open-source community is largely responsible for everything that works on the internet. Get that community a large language model, and it will outperform you every time, however many millions of dollars you pour into R&D in tech giants. Bottom line, the open-source community is a stable quantum computer to your 16k 1980s IBM machine – and the floppy disc it rode in on.

And the open-source community is doing precisely what the open-source community does, on the basis of Meta’s LLaMa platform. Not Google’s Bard, and not OpenAI’s ChatGPT.

Tomorrow’s capabilities – today.

For instance, the memo highlights that the open-source crowd has already cracked puzzles like:

“LLMs on a phone.
Scalable personal AI.
Responsible release: This one isn’t ‘solved’ so much as ‘obviated.’
Multimodality: A current multimodal ScienceQA SOTA was trained in an hour.”

What’s more, the memo, which purports to be from a Google staffer, starkly points out that while the big models still hold a slight edge in terms of quality, that gap is closing with astonishing rapidity. Six weeks from now? Six months?

“Open-source models are faster, more customizable, more private, and pound-for-pound more capable. They are doing things with $100 and 13B params that we struggle with at $10m and 540B. And they are doing so in weeks, not months. This has profound implications for us.”

Too true. Without quoting too freely from the document, it starkly predicts:

“We have no secret sauce. Our best hope is to learn from and collaborate with what others are doing outside Google. We should prioritize enabling 3P integrations.
People will not pay for a restricted model when free, unrestricted alternatives are comparable in quality. We should consider where our value add really is.
Giant models are slowing us down. In the long run, the best models are the ones which can be iterated upon quickly. We should make small variants more than an afterthought, now that we know what is possible in the <20B parameter regime.”

The breakdown.

What does all this actually mean?

Essentially, smaller, more agile, more project-specific versions of generative AI, that for instance, can be run on a handful of threadrippers, rather than the massively power-hungry processing power currently responsible for the likes of ChatGPT and Bard.

ChatGPT bug exposes Redis vulnerablity issue

James Tyrrell — Wed, 29 Mar 2023 16:09:29 +0000

When ChatGPT was first released in November 2022, there were concerns in some quarters that the advanced chatbot, which had been trained on text scraped from the internet, could be used to write malware. The threat model was that bad actors no longer needed advanced programming skills to write code capable of tricking victims into handing over personally identifiable information (PII). Instead, adversaries could simply prompt ChatGPT with suitable keywords and copy and paste the output, rather than having to puzzle out the programming from scratch. But it turns out that a ChatGPT bug made gathering PII easier still.

Not my number

Users upgrading from OpenAI’s free research preview of ChatGPT to a paid-for ChatGPT Plus version reported that validation code requests contained telephone numbers and email addresses that they didn’t recognize. And the reason for this confusion? A programming error known as a race condition, where rather than data being served in a logical, predictable manner, processes compete for resources in an uncoordinated and unpredictable way.

Race conditions can cause programs to crash as code is fed with unexpected or incorrect results. But, depending on the error handling, apps may continue running and treat the erroneous output as genuine. And this appears to be the case for OpenAI’s implementation of its ChatGPT web UI.

Multiprocessing glitch

Information held in OpenAI’s database propagates across to the Redis environment. And requests and responses are managed in a cooperative multitasking fashion thanks to Async IO – a concurrent programming design supported in Python. Connections between the database server and Redis cluster exist as a shared pool, with incoming and outgoing queues. Ordinarily, the system works fine, but an issue can occur if a request is canceled after it has been pushed onto the incoming queue, but before the response has left as part of the outgoing sequence of information.

Typically, these canceled requests result in an ‘unrecoverable server error’, and users will have to resubmit their request. But not always. The routine will consider the data returned as being valid if the corrupted value happens to be of the same data type as the incoming request – even if it belongs to another user – as the makers of ChatGPT discovered. Adding to the drama, OpenAI’s coders had introduced a change (on 20 March 2023) that caused Redis request cancellations to spike. And with more cancellations, there were more chances that the data types would match.

Open-Source - TechHQ

Hugging Face Safetensors vulnerable to supply chain attacks

READ NEXT

AI improves with the right additional tools

Vulnerabilities in Hugging Face

READ NEXT

How to run your own AI

LeftoverLocals

READ NEXT

Tech tools for business: best text-based video editing apps

Europe becomes data-interoperable – but do its open standards fall short of the mark?

READ NEXT

EU eyes technology sovereignty with its European Chips Act

Open standards and interoperability

READ NEXT

Exploring the groundbreaking EU AI Act

Here’s why restricting China from RISC-V tech would hurt the US

READ NEXT

US-China trade war: New executive order, same old mistakes?

What is RISC-V, and why is it the center of the US-China tech war?

China leads the RISC-V adoption

READ NEXT

US-China relations – bluster, blunder, and strategy?

What’s next?

READ NEXT

Apple’s move to ARM expected in ‘chip wars’

A tech war for the future of the world?

How is Moodle advancing the AI education debate?

READ NEXT

AI in Edtech – the answer to a growing teaching crisis?

What is Moodle?

READ NEXT

Edtech, Part 1: How Online Learning Succeeds

The potential of artificial intelligence in eLearning

Meta, Microsoft release new AI language model for commercial use

READ NEXT

Meta teases Quest 3 right before Apple’s WWDC event

What’s new with the latest AI model by Meta?

YOU MIGHT LIKE

Can generative AI ever be safe to use with proprietary data?

Qualcomm partners with Meta to run LLaMA 2 on phones

Knowledge graph technology: sharpening data visibility for better decisions

GenAI delivering knowledge graph technology.

READ NEXT

Knowledge graphs: have supply chain data your way with generative AI

Knowledge graph technology improves data focus.

Finding valuable data and drawing business conclusions – priceless.

YOU MIGHT LIKE

SPLOG – the data management issues facing generative AI

Knowledge graph technology and open-source.

READ NEXT

The New Challenges in Unstructured Data Management

Knowledge graph technology – norm of the future?

Knowledge graph technology – a new way of looking at data.

Generative AI a threat to human survival – CAIS

Voices of concern.

READ NEXT

Data management specialist warns against data dangers of generative AI

The disaster movie cliché.

Pandemics, nuclear war, generative AI.

“Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war.”

YOU MIGHT LIKE

What might open-source generative AI mean for proprietary software?

Generative AI deepfakes – the death of truth?

Fake narratives.

READ NEXT

White House underscores responsibilities of the generative AI industry

A question of responsibility.

The AI election.

When what you see automatically becomes “the truth,” who wags your dog?

What might open-source generative AI mean for proprietary software?

READ NEXT

Open-source coders have generative AI now – and it could change everything

Rise and stumble.

An understood model of the world.

READ NEXT

Cybersecurity firms examine ChatGPT threat model

The certainties of life.

READ NEXT

ChatGPT versus Google – the future of search