AI training is expensive: Only technology “giants” can afford it?

AI training is expensive: Only technology “giants” can afford it?

Xuan Chinh Nguyen

23:34 01/06/2024

2 phút đọc

Researchers say data is key to creating more intelligent and capable AI systems. The article takes the example of two text generation models, Llama 3 from Meta and OLMo from the Allen Institute for Artificial Intelligence (AI2) to illustrate. Although it has almost the same structure, Llama 3 is trained on a larger amount of data so it performs better.

Photo source: GettyImages

However, data quality is just as important as quantity. AI models operate based on the principle of “garbage in, garbage out”, so filtering and checking data quality is necessary.

Data racing can lead to problems. Experts fear that the focus on big and high-quality data will turn AI development into the monopoly of a few companies with big budgets. They can monopolize data sets and stifle innovation by others.

Additionally, data collection is sometimes not transparent. Some AI companies have pulled data from sources such as YouTube videos, Google Maps reviews without asking permission from content owners or creators. Some companies are even considering using copyright-protected content to train their models.

Another problem is the use of cheap labor in developing countries to label training data. These people are paid low wages and are exposed to violent content for long periods of time without benefits.

Commercial data transactions are also not entirely fair. OpenAI has spent hundreds of millions of dollars to buy content rights, far exceeding the budgets of most research groups, non-profit organizations and startups.

With the AI training data market expected to grow strongly, data platforms are charging higher fees. This hurts the AI research community as a whole because smaller groups cannot afford it.

However, there are some independent efforts to make open data sets free for everyone. EleutherAI, a nonprofit research group, is collaborating with the University of Toronto and other institutions to build The Pile v2, a suite of billions of text snippets.

The question is whether these efforts can keep up with major technology corporations. If data collection and testing still depends on financial resources, the answer is likely no, at least until there is a research breakthrough that levels the playing field.

Chia sẻ bài viết:

Từ khoá:

AI

Có thể bạn sẽ thích

Nhận xét (0)

Đánh giá ngay

Bài viết liên quan

Xuan Chinh Nguyen

09:00 20/12/2024

Palm Mini 2 Ultra: Máy tính bảng mini cho game thủ

Alldocube, thương hiệu nổi tiếng với các dòng máy tính bảng đa dạng trên Amazon, vừa công bố sản phẩm mới Palm Mini 2 Ultra. Với cấu hình mạnh mẽ, màn hình sắc nét và hỗ trợ tay cầm chơi game, thiết bị này nhắm đến người dùng yêu thích game di động và giả […]

Xuan Chinh Nguyen

08:48 29/09/2024

Robot with smart grip

Scientists at the Swiss Federal University of Technology Lausanne (EPFL) have created a very unique invention: a robotic hand that can detach from the arm and move on its own to pick up objects. Imagined like a mechanical snail, this hand can crawl to any location within reach to perform the assigned task. A recent […]

Xuan Chinh Nguyen

23:41 26/05/2024

NASA’s goal of conquering the Sun

Our sun is the best-studied star in the universe. However, there is a big mystery that remains unsolved. The Sun’s surface is about 6,000 degrees Celsius, but its outermost layer of atmosphere, called the corona, is incredibly hotter – about 1 million degrees Celsius. The corona can be seen as a halo surrounding the shadow […]

Xuan Chinh Nguyen

13:21 19/05/2024

Apple launches a new feature that makes it easier to use your phone while sitting on vehicle

Have you ever experienced the feeling of nausea during motion sickness but still tried to focus on your phone screen to relieve yourself? Or do sudden sudden brakes or the feeling of swaying make you even more uncomfortable? That’s when your body is starting to “fight” with the conflict between vision and hearing, forcing you […]

Xuan Chinh Nguyen

00:33 15/05/2024

Google Photos launches smart search feature “Ask for photos”

Google Photos is about to have a very smart new feature called “Ask Photos”. Developed based on Gemini artificial intelligence (AI) created by Google itself, this feature is expected to launch later this summer. With “Ask Photos”, users can search through photo collections on Google Photos using natural language, instead of having to manually enter […]

Roku streams live MLB baseball games for free - Techlade

Xuan Chinh Nguyen

10:16 14/05/2024

Roku streams live MLB baseball games for free

You can watch Sunday’s Major League Baseball (MLB) games on Roku for free, starting with St. Louis Cardinals and Boston Red Sox on May 19. Matches will be streamed live on The Roku Channel – no registration or account required. To watch, you need to download The Roku Channel to your Roku device or TV. […]

Gun detection AI technology company uses Disney to successfully persuade New York - Techlade

Xuan Chinh Nguyen

09:52 14/05/2024

Gun detection AI technology company uses Disney to successfully persuade New York

When New York City Mayor Eric Adams first met with representatives from Evolv, the gun-detection AI company, he was given a list of locations where the scanner could be used, including hospitals, schools, and Times Square. University and Port Authority Bus Terminal. According to emails obtained by Wired, what appears to have convinced Adams was […]

Xuan Chinh Nguyen

01:01 11/05/2024

Hackers claim to have collected 49 million Dell customer addresses before the company discovered the breach

A hacker calling himself Menelik claimed he stole the data of 49 million Dell customers. Menelik claims to have illegally accessed an online Dell portal and stolen customer data, including home addresses, directly from Dell’s servers. Techlade has verified that a portion of the stolen data matches Dell customer records. On Thursday, Dell sent an […]

Xuan Chinh Nguyen

08:51 10/05/2024

Thai food delivery app Line Man Wongnai plans to IPO in Thailand and the US in 2025

Line Man Wongnai, Thailand’s leading on-demand food delivery app, is considering going public (IPO) in Thailand or the US in 2025, according to CEO and co-founder Yod Chinsupakul in an interview. Exclusively with Techlade. There has been no final decision on the exchange, but the possibility of dual listing in both Thailand and the US […]

Xuan Chinh Nguyen

08:33 10/05/2024

Google pioneered the development of the first social networking application for Android

Sara Beykpour, former senior product manager at Twitter and now co-founder of AI news startup Particle, shared her story in a podcast. Joining Twitter in 2009 as a tools engineer when the company had just 75 employees, Beykpour then moved to mobile, where she witnessed the explosion of third-party Twitter apps across platforms. like BlackBerry […]

Xuan Chinh Nguyen

08:20 10/05/2024

AI outperforms humans in gaming: Altera receives investment from Eric Schmidt

Intelligent gaming robots (AI) are coming and Altera, a new startup, is entering the game to build this new generation of AI. altera , the startup, just announced it has raised $9 million in a seed funding round. This capital call round exceeded registration and was co-led by two reputable investment funds: First Spark Ventures […]

Xuan Chinh Nguyen

01:15 10/05/2024

TikTok automatically labels AI content from platforms like DALL·E 3

TikTok begins automatically labeling AI-generated content from other platforms. Any video/photo posted to TikTok, if using a service like OpenAI’s DALL·E 3, will automatically have an “AI-generated” label to notify viewers. This video platform uses Content Credentials, a technology from the Content Provenance and Authenticity Alliance (C2PA) – an organization co-founded by Microsoft and Adobe. […]

Xuan Chinh Nguyen

00:23 10/05/2024

Dell’s data was hacked, revealing customers’ home address information

Today, Dell, a famous computer manufacturer, officially confirmed a data leak incident affecting customers’ personal information. According to the announcement, this incident involved the disclosure of the names and addresses of some Dell customers. According to information from Techlade and widely shared on social networks, Dell confirmed that it is investigating an “incident related to […]

Xuan Chinh Nguyen

00:12 10/05/2024

Cracking passwords using Brute Force takes more time, but don’t rejoice!

Although it takes longer to crack passwords than before using Brute Force, cybersecurity experts warn that this is not necessarily good news. The security level of a password depends directly on its length and composition, including numbers, letters and special characters. The shorter and simpler the password, the easier it is to crack in a […]

Xuan Chinh Nguyen

19:08 25/03/2024

US lawsuit against Apple: What will happen to iPhone and Android?

Commentary: The US government is asking Apple to expand access to the iPhone, just as smartphones could be entering the next phase of pivotal change. The eternal debate: iPhone or Android? Maybe you chose your side a long time ago and never looked back. A landmark antitrust lawsuit is taking aim at this, demanding that […]

Xuan Chinh Nguyen

20:29 23/03/2024

The UAE will likely help fund OpenAI’s self-produced chips

According to a report by the Financial Times, OpenAI’s ambition to develop its own semiconductor chips to power advanced AI models may receive support from the United Arab Emirates (UAE). The report said MGX – a state-backed conglomerate in Abu Dhabi – is in discussions to support OpenAI’s project to build an in-house AI chip. […]

Xuan Chinh Nguyen

20:19 23/03/2024

AI-composed blues music lacks human flair and rhythm

“Soul Of The Machine” may not sound like a real song at first glance. However, with the development of AI technology, the line between music created by humans and machines is increasingly blurred. This song was composed by Suno, an AI tool specializing in music creation developed by a startup of the same name. According […]

Xuan Chinh Nguyen

18:01 23/03/2024

iOS 17: iPhone is safer with anti-theft feature

In January, when Apple updated iOS 17.3, it brought a number of patches and new features to your iPhone, including the long-awaited collaborative playlist feature in Apple Music. Besides, this technology giant also equips the iPhone with a new security feature called Stolen Device Protection , aimed at protecting your data if the phone is […]

Xuan Chinh Nguyen

16:35 23/03/2024

Samsung launches 2024 OLED TV with the highlight of breakthrough anti-glare technology

Both Samsung and LG launched new OLED TVs this month. The prices of the two brands are quite similar, and small improvements may be the deciding factor for users to choose the type of TV that suits their needs. Samsung announced its 2024 TV prices on Wednesday, with the cheapest OLED model, the 55-inch S90D, […]

REGISTER

TODAY

Sign up to get the inside scoop on today's biggest stories in markets, technology delivered daily.

By clicking “Sign Up”, you accept our Terms of Service and Privacy Policy. You can opt out at any time.