LANGUAGE models have become a key factor when it comes to creating the most thorough and accurate artificial intelligence possible. The new model developed by Microsoft and Nvidia is said to feature about 530 billion parameters and to be capable of exceptional accuracy, especially in reading comprehension and complex sentence formation.
Nvidia and Microsoft's Megatron-Turing Natural Language Generation model (MT-NLG) marks a new record for a language model. According to the tech firms, their model is the most powerful to date.
Thanks to its 530 billion parameters, it is able to outperform OpenAI's GPT-3 as well as Google's BRET. Specialized in natural language, it is able to understand texts, reason and make deductions to form a complete and precise sentence.
Language models are built around a statistical approach. While many methods exist, it is the n-gram model that is being used here.
The learning phase enables analysis of a large quantity of texts to estimate the probabilities that a word will 'fit' correctly in a sentence.
The probability of a word sequence is the product of the probabilities of the words previously used. By using probabilities, we can create perfectly grammatical sentences.
Biased algorithms still an issue
With 530 billion parameters, the MT-NLP model is particularly sophisticated. In the field of machine learning, parameters are often defined as the unit of measurement for machine performance.
t has been repeatedly shown that models with a large number of parameters ultimately perform better, resulting in more accurate, nuanced language due to their large dataset.
These models are capable of summarizing books and texts and even writing poems.
To train MT-NLG, Microsoft and Nvidia created their own dataset of about 270 billion "tokens" from English-language websites.
In natural language, "tokens" are used to break up text into smaller chunks to better distribute information.
The websites included academic sources such as Arxiv, Pubmed, educational websites such as Wikipedia or Github as well as news articles and even messages on social networks.
As always with language models, the main problem with widespread, public use is bias in the algorithms.
The data used to train machine learning algorithms contain human stereotypes embedded in the texts.
Gender, racial, physical and religious biases are widely present in these models. And it is particularly difficult to remove these problems.
For Microsoft and Nvidia, this is one of the main challenges with such a model. Both companies say that the use of MT-NLG "must ensure that proper measures are put in place to mitigate and minimize potential harm to users."
Before fully benefiting from these revolutionary models, this issue needs to be tackled, and for the moment it seems far from resolved.
ETX Studio
Wed Oct 13 2021

Language patterns reach record highs, but questions remain. - ETX Studio

Kremlin says Russia has lots of rare earth metals that the US needs and is open to cooperation
The study measures damage to buildings, impact on lives, and the cost to "build back better," according to a joint statement.

Musicians release silent album to protest UK's AI copyright changes
Creative industries face legal and ethical issues as AI generates content after training on popular works without compensating creators.

Petronas' profit slips 32 pct to RM55.1 bln in FY2024 due to unfavourable prices
Petronas says profit was hit by an unfavorable forex translation reserve after divesting Engen Group.

Apple plans $500 billion in US investment, 20,000 research jobs in next four years
Many of Apple's products that are assembled in China could face 10% tariffs imposed by Trump earlier this month.

3,600 pornographic websites blocked as of Feb 15 - Fahmi
Fahmi Fadzil says MCMC also has a special team that records complaints from the public.
![[OPINION] Exploring the environmental impact of hybrid working [OPINION] Exploring the environmental impact of hybrid working](https://resizer-awani.eco.astro.com.my/tr:w-177,h-100,q-100,f-auto/https://img.astroawani.com/2025-02/51740471213_tbwork.jpg)
[OPINION] Exploring the environmental impact of hybrid working
For a country like Malaysia, the widespread shift to remote work carries unforeseen consequences.

Subsidised rice to be monitored with two-tier control system
The government has allocated 24 million of 10-kg bags priced at RM26 each to be gradually distributed.

New search for MH370 wreckage under way off Australian coast
Ocean Infinity will deploy search vessel and autonomous underwater vehicles to search the seabed for traces of the missing Boeing 777.

Manchester United to cut jobs, bonuses and free lunches to restore profits
According to the club, the plan is in addition to 250 jobs removed last year.

Malaysia through ASEAN Chairmanship will advocate for deeper collaboration in green, sustainable investment frameworks - DPM Fadillah
No country can achieve its sustainability goals without strong regional cooperation, says Datuk Seri Fadillah Yusof.

Malaysian fresh durians command premium status in China market
Durian from Malaysia is now available in 16 regions in China, says Datuk Arthur Joseph Kurup.

At least four dead in South Korea highway construction project collapse
The steel structures supporting the highway bridge collapsed one after another after being hoisted into place by a crane.

Petronas evaluating rightsizing efforts
Tan Sri Tengku Muhammad Taufik Tengku Aziz emphasises that this is not about cutting 15,000 to 16,000 jobs.

Urban renewal is not a new agenda, has started since 2012 - PM
Since 2012, 74 engagement sessions have been held and we have taken into account the problems faced by the people, PM says.

Musk renews firing threat after being stymied by federal officials
Elon Musk renews his threat to fire federal workers who do not comply with his demand to justify their jobs.

US tells federal agencies they can ignore Musk ultimatum
According to an internal communication from the Justice Department, staff were not obligated to respond to the email.

Trump team seeks to toughen chip controls over China, Bloomberg News reports
Some Trump officials also aim to further restrict the quantity and types of Nvidia chips that can be exported to China without a license.
![[OPINION] Reframing narratives on women and development in Malaysia [OPINION] Reframing narratives on women and development in Malaysia](https://resizer-awani.eco.astro.com.my/tr:w-177,h-100,q-100,f-auto/https://img.astroawani.com/2015-08/81439085721_WORKINGWOMEN.jpg)
[OPINION] Reframing narratives on women and development in Malaysia
Future policies must tackle gender equality holistically, addressing workforce gaps, structural barriers, and power imbalances.

Candidates to replace Canada's Trudeau focus on Trump
The next election must be held by October 20 this year.
![[COLUMNIST] Malaysia's economy is a steady ship in choppy waters [COLUMNIST] Malaysia's economy is a steady ship in choppy waters](https://resizer-awani.eco.astro.com.my/tr:w-177,h-100,q-100,f-auto/https://img.astroawani.com/2025-02/41738359038_TBklcc.jpg)
[COLUMNIST] Malaysia's economy is a steady ship in choppy waters
Ultimately, the key is how to ensure smooth and timely implementation of reforms to leverage Malaysia's sound fundamentals.