Connect with us

Technology

Apple says it took a ‘responsible’ approach to training its Apple Intelligence models

Published

on

In this photo illustration, the

Apple published technical paper detailing the models developed for Apple Intelligence, a range of generative AI features coming to iOS, macOS, and iPadOS over the subsequent few months.

In the article, Apple opposes accusations that it took an ethically questionable approach to training a few of its models, reiterating that it didn’t use private user data but as a substitute relied on a combination of knowledge publicly available and licensed to Apple Intelligence.

“(The) pre-training dataset consists of… data we have licensed from publishers, curated publicly available or open datasets, and publicly available information crawled by our web crawler, Applebot,” Apple writes within the article. “Given our focus on protecting user privacy, we note that no private Apple user data is included in the data mix.”

Proof News in July reported that Apple used a dataset called The Pile, which comprises captions from a whole lot of 1000’s of YouTube videos, to train a family of models designed for on-device processing. Many YouTube creators whose captions were wolfed up by The Pile were unaware of this and didn’t consent to it; Apple later issued a statement saying it had no intention of using the models to power any AI features in its products.

A technical paper that offers a sneak peek on the models that Apple first unveiled at WWDC 2024 in June, titled Apple Foundation Models (AFM), emphasizes that the training data for the AFM models was acquired in a “responsible” manner — or at the least responsibly by Apple’s definition.

The training data for the AFM models includes publicly available Internet data, in addition to licensed data from undisclosed publishers. According to The New York Times, Apple I contacted several publishers in late 2023, including NBC, Condé Nast, and IAC, with multi-year deals price at the least $50 million to train models on publishers’ news archives. Apple’s AFM models were also trained on open-source code hosted on GitHub, specifically Swift, Python, C, Objective-C, C++, JavaScript, Java, and Go code.

Training models on code without permission, even open source, is a point of contention amongst developers. Some developers have argued that some open-source code bases are unlicensed or don’t allow AI training of their terms of use. However, Apple says it has “licensed” the code to try to include only repositories with minimal usage restrictions, reminiscent of those licensed under the MIT, ISC, or Apache licenses.

To boost the mathematical skills of the AFM models, Apple specifically included math questions and answers from web sites, math forums, blogs, tutorials, and seminars within the training set, according to the article. The company also used “high-quality, publicly available” data sets (which the article doesn’t specify) with “licenses that allow use to train… models,” filtered to remove sensitive information.

In total, the training dataset for the AFM models weighs in at about 6.3 trillion tokens. (Tokens are small pieces of knowledge which are typically easier for generative AI models to digest.) By comparison, that’s lower than half the variety of tokens — 15 trillion — that Meta used to train its flagship text-generating model, Llama 3.1 405B.

Apple acquired additional data, including human and artificial data, to refine the AFM models and attempt to mitigate any undesirable behaviors reminiscent of toxicity release.

“Our models are designed to help users perform on a regular basis tasks on Apple products in a way that’s well-established
in Apple’s core values ​​and rooted in our principles of responsible AI at every stage,” the corporate said.

There is not any hard evidence or shocking insights within the article, and that is due to its careful design. Rarely are such articles very revealing, due to pressures of competition, but in addition because revealing much of the data could get corporations into legal trouble.

Some corporations that train models by scraping public web data claim that their practice is protected by fair use doctrine. But that is a difficulty that is extremely controversial and the topic of a growing variety of lawsuits.

Apple notes within the article that it allows webmasters to block the crawler from scraping their data. But that puts individual creators in a difficult position. What’s an artist to do if, for instance, their portfolio is hosted on a site that refuses to block Apple from scraping their data?

Court battles will resolve the fate of generative AI models and the way they’re trained. For now, though, Apple is trying to position itself as an ethical player while avoiding unwanted legal scrutiny.

This article was originally published on : techcrunch.com
Continue Reading
Advertisement
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Technology

Is Your City a Sunset City? This Interactive Map Will Tell You –

Published

on

By

sundown. town m ap

Tougaloo College’s interactive map permits you to explore cities and communities across the United States, and discover in the event that they are considered sunset cities.


Some interactive map from Tougaloo College means that you can study cities and communities within the United States and discover in the event that they are considered sunset cities.

According to Britannicasunset in US history is a city that excluded people of color, most frequently African Americans, when the sun went down. The way people enforced these “rules” ranged from collective violence similar to public lynchings, discriminatory laws, and discrimination in open housing.

The map is inspired by a database introduced by the late historian and sociologist James W. Loewen. He is the writer of the classic bestseller. Tougaloo’s History and Social Justice section has included what describes as “the only cities in the world with a twilight register.”

“Sunset Town isn’t just a place where something racist happened,” researchers write on the database’s website. “It’s an entire community (and even a county) that was intentionally ‘all white’ for decades.”

The researchers add that “All white” is in quotation marks because some towns historically “allowed one black family to remain while expelling the rest.” They also indicate that some sunset towns also barred Chinese, Jews, Mexicans, Native Americans, and in some cases, Mormons.

How to read a map

For the interactive map, users should hover over a state to see an alphabetical list. The map key consists of six colours used as dots to discover cities which are definitely, probably, possibly, or unlikely to be sunset cities.

Black dots indicate black cities or municipalities. Places on the interactive map with a red flag indicate places of special importance.

On at first glance, the Midwest and The Plains region seems to have more dots indicating definite, probable, and possible sunset cities. The map shows that there just isn’t a single state in America that doesn’t have a suspected sunset city.

As for black cities and towns, the ten listed are Pembroke and Brooklyn, Illinois. Expose, Mound Bayou, New Africa, Renova, and Winstonville, all in Mississippi. Maryville, South Carolina; Martinsville, Indiana; and North Amityville, New York.


This article was originally published on : www.blackenterprise.com
Continue Reading

Technology

Adam Neumann’s Flow Startup Launches Co-Living Community in Saudi Arabia

Published

on

By

Flow, Adam Neumann’s co-living startup, has opened a 238-apartment complex in the Saudi capital of Riyadh, Forbes has some details. The opening included an Aztec-style hot chocolate ceremony and bags reading “holy s— I live.” Rent for furnished units starts at $3,500 a month and includes hotel services like laundry and housekeeping, in addition to amenities like swimming pools, coed gyms (unusual in Saudi Arabia) and bowling alleys. Flow is constructing three other properties with almost 1,000 apartments in Riyadh.

The company’s first, less luxurious properties opened in April in Fort Lauderdale and Miami.

Flow raised $350 million from Andreessen Horowitz in 2022. The funding raised questions given the troubled history of Neumann’s previous startup, WeWork. Once valued at $47 billion, WeWork filed for bankruptcy protection last yr and was eventually acquired by Yardi, an actual estate group, for $450 million.

This article was originally published on : techcrunch.com
Continue Reading

Technology

Elon Musk Threatened with SEC Sanctions for Failure to Appear in Court

Published

on

By

Elon Musk threatened with SEC sanctions for failing to appear in court

Elon Musk, CEO of X and other firms whose names include the letter “X,” found himself in the crosshairs of regulators after he failed to testify this month as a part of an investigation into Musk’s acquisition of Twitter.

In a document filed today, the U.S. Securities and Exchange Commission (SEC) said it intends to impose sanctions on Musk after he missed a court-ordered hearing in Los Angeles Superior Court on September 10. According to the document, Musk didn’t notify the SEC that he wouldn’t appear for the hearing until three hours before the hearing was set to begin.

“The court must make clear that Musk must stop his games and delaying tactics,” the letter reads.

According to the documents, Musk spent September 10 overseeing the launch of Polaris Dawn, a spacecraft manufactured by his space exploration company, SpaceX.

SEC counsel proposed rescheduling Musk’s hearing for the following day, September 11. However, Musk’s lawyer declined, agreeing only to an October hearing.

The SEC is searching for “significant contingent relief” if Musk fails to appear in court in October. The agency has also indicated it plans to file a motion for sanctions against Musk to get well travel expenses for the canceled testimony and other relief. (In the lawsuit, the SEC said it spent “thousands of dollars” to fly three attorneys to Los Angeles for the Sept. 10 hearing.)

Musk’s court-ordered appearance stems from an SEC investigation into whether the billionaire acted lawfully in disclosing his Twitter stock purchases ahead of his $44 billion acquisition of the corporate in 2022. The investigation can also be looking into whether Musk’s statements in regards to the transactions were misleading; the SEC alleges that Musk waited at the least 10 days too long to disclose that he was buying Twitter stock.

The investigation is the second time Musk has found himself under the SEC’s gun in recent years. In 2018, the agency ordered Musk to step down as Tesla CEO and pay $40 million for tweets about Tesla stock that the SEC found amounted to market manipulation. At the time, Musk called the fraud allegations “unjustified.”

The Securities and Exchange Commission (SEC) also investigated Musk and Tesla over claims about Tesla’s vehicles’ ability to achieve “full autonomous driving” in addition to Tesla’s use of company funds to construct a “glass house” for Musk.

The full text of the appliance will be read below.

JOINT STATEMENT ON THE R… By SP-TechCrunch

This article was originally published on : techcrunch.com
Continue Reading
Advertisement

OUR NEWSLETTER

Subscribe Us To Receive Our Latest News Directly In Your Inbox!

We don’t spam! Read our privacy policy for more info.

Trending