PolyU research finds improving AI large language models helps better align with human brain activity

HONG KONG, May 27, 2024 /PRNewswire/ -- With generative artificial intelligence (GenAI) transforming the social interaction landscape in recent years, large language models (LLMs), which use deep-learning algorithms to train GenAI platforms to process language, have been put in the spotlight. A recent study by The Hong Kong Polytechnic University (PolyU) found that LLMs perform more like the human brain when being trained in more similar ways as humans process language, which has brought important insights to brain studies and the development of AI models.

Current large language models (LLMs) mostly rely on a single type of pretraining - contextual word prediction. This simple learning strategy has achieved surprising success when combined with massive training data and model parameters, as shown by popular LLMs such as ChatGPT. Recent studies also suggest that word prediction in LLMs can serve as a plausible model for how humans process language. However, humans do not simply predict the next word but also integrate high-level information in natural language comprehension.

A research team led by Prof. Li Ping, Dean of the Faculty of Humanities and Sin Wai Kin Foundation Professor in Humanities and Technology at PolyU, has investigated the next sentence prediction (NSP) task, which simulates one central process of discourse-level comprehension in the human brain to evaluate if a pair of sentences is coherent, into model pretraining and examined the correlation between the model's data and brain activation. The study has been recently published in the academic journal Sciences Advances.

The research team trained two models, one with NSP enhancement and the other without, both also learned word prediction. Functional magnetic resonance imaging (fMRI) data were collected from people reading connected sentences or disconnected sentences. The research team examined how closely the patterns from each model matched up with the brain patterns from the fMRI brain data.

It was clear that training with NSP provided benefits. The model with NSP matched human brain activity in multiple areas much better than the model trained only on word prediction. Its mechanism also nicely maps onto established neural models of human discourse comprehension. The results gave new insights into how our brains process full discourse such as conversations. For example, parts of the right side of the brain, not just the left, helped understand longer discourse. The model trained with NSP could also better predict how fast someone read - showing that simulating discourse comprehension through NSP helped AI understand humans better.

Recent LLMs, including ChatGPT, have relied on vastly increasing the training data and model size to achieve better performance. Prof. Li Ping said, "There are limitations in just relying on such scaling. Advances should also be aimed at making the models more efficient, relying on less rather than more data. Our findings suggest that diverse learning tasks such as NSP can improve LLMs to be more human-like and potentially closer to human intelligence."

He added, "More importantly, the findings show how neurocognitive researchers can leverage LLMs to study higher-level language mechanisms of our brain. They also promote interaction and collaboration between researchers in the fields of AI and neurocognition, which will lead to future studies on AI-informed brain studies as well as brain-inspired AI."

Media Contact
Ms Annie Wong
Senior Manager, Public Affairs
Tel: +852 3400 3853
Email: anniewy.wong@polyu.edu.hk

source: The Hong Kong Polytechnic University

【與拍賣官看藝術】畢加索的市場潛能有多強？亞洲收藏家如何從新角度鑑賞？► 即睇

1	《盤前攻略》通脹符預期美股個別走，騰訊績後ＡＤＲ下挫逾３％
2	恒指曾倒升惟５０天線見頂，半日仍跌１７３點報１９６４９
3	恒指再挫３８７點連跌五日至一個半月低，內地明公布重磅經濟數據
4	騰訊季績勝預期大行齊唱好，股價逆市造好可以點部署？
5	《午市前瞻》恒指恐試１８６６３，騰訊業績穩陣後市將見黃金交叉
6	【打風不停市】八號風球維持至今早，港股首啟打風不停市安排
7	《新股上市》九源基因傳今日開始評估投資者對其香港ＩＰＯ興趣
8	《盤後部署》打風不停市恒指跌至見月低，阿里健康回調買入博反彈
9	【打風不停市】打風開市首啟動，耀才許繹彬：公司運作一切正常
10	【大行炒Ｄ乜】眾大行績後上調騰訊目標，摩通降中行、交行評級

1	《窩輪豪情－梁業豪》淡友大戶進攻目標恒指１９３００
2	《菲常論證－溫蕎菲》騰訊明日放榜，好倉資金積極部署
3	《股林淘金－林家亨》金九銀十後，汽車股退潮跡象明顯
4	《投資智慧－鄧聲興》市建局應開標價
5	《法證攻防－林恩》人大常委會閉幕，特朗普重返白宮
6	《專家之言－葉尚志》試衝波幅範圍上限，關注互聯網龍頭績報
7	《缸邊隨筆－石鏡泉》特朗普的反華班子
8	《真知灼見－溫灼培》特朗普的壓抑通脹方案
9	《陶冬天下－陶冬》歡迎來到特朗普世界
10	《運籌帷幄－梁業豪》談美國調高中國入口貨品關稅

1	高息定存 \| 一周高息合集，銀行6個月最高3.6厘，3個月4厘
2	高息定存 \| 銀行紛搶存，恒生3個月加至3.6厘，創興高達3.9厘
3	高息定存 \| 渣打3個月存息減至3.3厘，虛銀逆市加至3.5厘
4	港股 \| 蕭猷華：恒指本周料下試20200點
5	小傳日記 \| 打卡經濟盡在一杯？廉署賣咖啡想你不請自來！
6	傳美國禁台積電供貨中國芯片股反造好，中芯華虹績後獲券商升目標可以點揀？
7	澳門派錢 \| 澳門明年度預算案提出續推現金分享等惠民措施
8	安聯美國短存續期高收益債券基金，助你捕捉收益及增長潛力！
9	再有17家重點企業簽約落戶香港，其中15家為內地企業
10	騰訊業績 \| 券商料第三季經調整純利升22%，遊戲收入增逾一成

1	美國大選2024 \| 2024美國大選即時結果，特朗普宣布勝利
2	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
3	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
4	內地救市見效樓市有起色，惟再有內房抽水可以點揀？
5	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
6	高息定存 \| 一周高息合集，銀行6個月最高3.6厘，3個月4厘
7	美國大選 \| 【FOCUS】「垃圾」牽動選票，美媒各有盤算
8	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
9	美國大選2024 \|【FOCUS】侵侵勝券在握，防美元反高潮
10	高息定存 \| 創興加3個月存息至3.6厘，渣打6個月3.48厘
11	把握股市大浪未贏錢先享獎賞開立東亞戶口賺高達HK$3,800獎賞
12	港股 \| 午市前瞻 \| 人行買斷式逆回購刺激料有限內房板塊短線向好可吼
13	高息定存 \| 銀行紛搶存，恒生3個月加至3.6厘，創興高達3.9厘
14	高息定存 \| 特朗普勝選美元走強，富邦一個月美元定存5.98厘
15	TAOBAO \| 市傳淘寶租中港城4萬呎舖，料開設大型體驗家具館
16	提振內房｜一文看懂，中國房地產政策組合拳
17	瀚亞專家投資智慧：市場動盪下，低波幅如何成為避險關鍵？
18	美國大選 \| 法國外貿銀行：若60%關稅屬實，損內地GDP增長率1百分點
19	恒指 \| 恒指午後升逾300點，人大常委開會期間中資金融股造好
20	光伏股 \| 協鑫科技曾飆三成大選前美國擬撤銷中國光伏反補貼稅
21	港股 \| 恒指午後升逾500點，人行預告下周LPR將減20至25基點
22	電池之戰 \| 【FOCUS】寧王搶佔增混商機，固態電池更牽暗戰
23	NVIDIA \| 英偉達股價創歷史新高，美銀分析師料會繼續上升
24	新股上市 \| 證監會優化新上市申請審批流程時間表，市值達百億A股有望獲快速審批來港上市
25	銀色債券 \| 銀債最多獲分24手，申請23手或以下獲全數配發
26	2025 多元資產部署解鎖環球股匯債市潛力
27	大家樂牛油 \| 大家樂否認轉用內地牛油，澄清荷蘭生產自家品牌維寶牛油醬
28	內銀股 \| 六大行減存款利率，人行年底前再降準，內銀造好可以點部署？
29	神州經脈 \| 6萬億化債政策出台，滬指全周升逾5%，人幣跌
30	神州經脈 \| 人大常委會下月初開會，MLF縮量續做，滬指升兩周

PolyU research finds improving AI large language models helps better align with human brain activity

大國博弈

狂人明年上台，世界新時代開始？

貨幣攻略

高息定存 | 渣打3個月存息減至3.3厘，虛銀逆市加至3.5...

傾力救市

11月8日的三件大事

說說心理話