Connect with us

AI Research

Selective identification of polyploid hepatocellular carcinomas with poor prognosis by artificial intelligence-based pathological image recognition

Published

on


Development of an AI-based image recognition model to estimate HCC ploidy

First, we constructed a model to evaluate the ploidy status of HCC using deep learning and CNN-based image classification. A total of 44 cases whose ploidy status had been determined by chromosome FISH in our previous study6 were used as the training data. The training set included 27 diploid and 17 polyploid HCC cases. After obtaining a whole-slide image of the HE-stained slide for each tumor, we selected three or more ROIs showing the representative pathological appearance of the tumor (Fig. 1a). Each ROI was divided into 2048×2048-pixel tiles, and the tiles were subdivided into 256 × 256 patches for input into the deep-learning algorithm (see Materials and Methods). Deep learning for tumor ploidy classification was performed by training 42,240 small-patch images. The models calculated the probability of tumor polyploidization in each 2048 × 2048-pixel tile, and the average value across all tiles for each tumor was defined as the polyploidy score for the tumor (Fig. 1b, Supplementary Fig. 4).

Fig. 1: Construction of AI models to determine ploidy status of HCC.

a Scheme for the construction of AI-based image recognition models for determining HCC ploidy. b Representative HE-stained images of ROIs in diploid and polyploid HCC. The probabilities of HCC polyploidization in the corresponding 2048 × 2048-pixel tiles are shown in a color map. Scale bar, 200 μm. c ROC curves and AUC values of representative AI models in cross-validation. The data for the other models are shown in Supplementary Fig. 5. d Evaluation and comparison of the constructed AI models.

We first used three CNN-based image classification models: DenseNet12115, ResNet50d16, and EfficientNet_B017. The validity of the models was assessed by analyzing their receiver operating characteristic (ROC) curves and areas under the curve (AUC)24 (Fig. 1c, Supplementary Fig. 5). Five-fold cross-validation revealed that all three models achieved high AUC values (0.998−1.0, Fig. 1d). For example, with the optimized cutoff value (0.457) of polyploidy score determined based on the ROC curve, the EfficientNet_B0-based model exhibited high accuracy, sensitivity, and specificity (0.977, 1.0, and 0.963, respectively, Fig. 1d, Supplementary Fig. 6). These findings indicate that CNN-based models can be used to evaluate the HCC ploidy status using pathological HE images.

AI-based image recognition successfully assessed HCC ploidy at a low calculation cost

The coloration of HE staining is known to vary due to factors such as fixation conditions and staining protocols, potentially affecting AI model performance25,26. To address this, we constructed a model using EfficientNetB0 on grayscale-converted images to minimize the impact of such variability. The constructed EficientNetB0_gray model showed a high AUC value (0.998), comparable to that of the original CNN-based models, suggesting that the cellular morphological information obtained from grayscale images was sufficient to evaluate HCC ploidy (Fig. 1c, d, Supplementary Fig. 6).

We also developed models using ViT-based architectures, which incur lower calculation costs than CNN-based image recognition. Two and one models were constructed using HIPT20 and DINO19, respectively, both of which enabled the scalability of ViT to large images via self-supervised learning (see Materials and Methods). These encoders were trained on TCGA or liver pathology images obtained at our institution. By freezing parts of the model during training, overfitting can be moderated, even when the labeled data are insufficient. In particular, because it allows for easy replacement of the first stage with other publicly available models trained on pathology images using self-supervised learning, model construction using HIPT requires a shorter learning time than CNN-based models. All three models exhibited high accuracy and AUC values that were comparable to those of the CNN models (Fig. 1d, Supplementary Fig. 6).

AI-based ploidy assessment identified polyploid HCC cases with poor prognosis within a large cohort

We examined whether our constructed AI models could properly assess HCC ploidy using a separate dataset. Tumor ploidy was determined using chromosome FISH in 38 new HCCs (Dataset 2) that were not included in the first dataset (Dataset 1). Their polyploidy scores were then calculated by analyzing their HE images using AI models. The sensitivity, specificity, and proportion of polyploid HCC were determined based on the cutoff values determined in the analysis of Dataset 1 (Fig. 1d, Supplementary Fig. 6). Among the models examined, some, including the two HIPT-based models, exhibited relatively high AUC values over 0.8 (Fig. 2a, b). The decrease in accuracy observed in Dataset 2 compared to Dataset 1 may be attributed to the fact that cases with typical histology of diploid and polyploid cancers were used for training in Dataset 1, while cases in Dataset 2 were selected in an unbiased manner.

Fig. 2: Validation of AI models in separate datasets.
figure 2

a, b Performance of AI models in the validation assessments. The ploidy statuses of 38 HCCs (determined by chromosome FISH) were compared with the ploidy statuses, as assessed by AI models. ROC curves of the representative AI models are shown in (a). c Prognostic stratification based on ploidy assessments by the AI models. A total of 169 HCCs were analyzed. d Kaplan–Meier curves of overall survival. Statistical difference was determined by log-rank test. The three AI models that identified a significant difference in prognosis between diploid and polyploid HCCs in (c) are shown.

To further evaluate the utility of AI-based polyploid HCC identification, a large cohort of 169 HCC cases (Dataset 3) was examined using AI models (Fig. 2c). In particular, the EfficientNet_B0-based and HIPT_unfrozen2 models diagnosed a number of polyploid HCC cases proportional to their prevalence, as shown in previous reports (36–38% 3,6). By identifying polyploidy in HCC, the EfficientNet_B0-based and HIPT_unfrozen2 models discriminated HCC patients with significantly worse overall survival after surgery (Fig. 2c, d, Supplementary Fig. 7). These findings indicate that AI models, especially the HIPT_unfrozen2 model, are useful for identifying polyploid HCC and predicting poor prognosis.

Analysis of a large cohort revealed the characteristics of polyploid HCC

The HIPT_unfrozen2 model, which exhibited the most optimal features for ploidy determination among the constructed models, was used to investigate the characteristics of polyploid HCC by analyzing a large cohort. In Dataset 3, consisting of 169 cases, 113 and 56 cases were diagnosed as diploid and polyploid HCC, respectively, using the HIPT_unfrozen2 model. As observed in other datasets, where no associations were found between tumor ploidy and age, sex, or body mass index (Supplementary Table 1), the two groups showed no significant differences in these variables (Table 1, Supplementary Fig. 8). Consistent with our previous results, serum alpha-fetoprotein (AFP) levels were significantly higher in polyploid HCC than in diploid HCC, whereas tumor size and stage were comparable between the two groups (Table 1, Fig. 3a). Polyploid HCC was also significantly associated with a high prevalence of poor differentiation and exhibited MTM or scirrhous structures (Table 1, Fig. 3b, c). Polyploid giant cancer cells (PGCCs), which exhibit a distinct appearance with prominently large nuclei or profound multinucleation, are frequently observed in polyploid HCC (Table 1). Furthermore, the expression of UBE2C, which we previously reported as a marker suggestive of polyploid HCC, was significantly elevated in polyploid HCC relative to levels in diploid HCC (Fig. 3d). These findings confirm the characteristics of polyploid HCC demonstrated in our previous study and suggest accurate ploidy evaluation by our HIPT_unfrozen2 model. In addition, most polyploid HCCs diagnosed using the AI model did not exhibit well-defined pathological features characteristic of polyploid HCC (Fig. 3e), indicating that the AI model comprehensively assessed ploidy in HCC, considering a complex array of histological information beyond mere tumor structures and differentiation status.

Fig. 3: Clinicopathological features of polyploid HCC assessed using the HIPT_unfrozen2 model.
figure 3

a Serum AFP levels. Error bars indicate mean ± SD. b, c Pathological classification of HCC differentiation and structure. d Immunostaining of UBE2C. Scale Bar 50μm. e Heatmap indicating ploidy scores assessed using the HIPT_unfrozen2 model and clinicopathological features. f t-SNE plots of tile images. Probabilities of polyploidy assessed using the HIPT_unfrozen2 model and clinicopathological features of the tumors are shown. A total of 169 HCCs were analyzed. SC scirrhous, MacroT macro-trabecular, MicroT micro-trabecular, C compact, PG pseudo-glandular, UC unclassified, PIVKA protein induced by vitamin K absence or antagonist II, HBV hepatitis b virus, HCV hepatitis c virus, MASLD metabolic dysfunction associated steatotic liver disease, PBC primary biliary cholangitis.

Table 1 Clinicopathological information of 169 HCC cases classified by tumor ploidy determined using the HIPT2_unfrozen2 model

To further explore the characteristics of polyploid HCC, we visualized case-by-case correlations between the polyploidy scores and clinicopathological features (Fig. 3e). In addition, data derived from all 2048 × 2048-pixel tile images of the 169 HCCs were compressed into two dimensions and visualized using t-SNE plots (Fig. 3f). These plots validated that high serum AFP levels were correlated with high polyploidy probability values calculated using our AI models. Interestingly, HCCs with high polyploidy scores were predominantly positive for PGCCs, highlighting their importance in inferring HCC polyploidy (Fig. 3e). In contrast, hepatitis etiology seemed to exert little influence on HCC ploidy, and HCCs with high polyploidy scores developed in livers with viral hepatitis and steatotic liver diseases (Fig. 3e, f). Our investigation of poorly understood features of polyploid HCC in a large cohort, utilizing the high-throughput analysis capabilities of AI models, verified recently revealed characteristics and provided additional insights.

The AI model robustly identified polyploid HCC in a public dataset and predicted a poor prognosis

To further verify the utility of the AI-based ploidy discrimination models, we analyzed the HE images of 350 HCC cases in the public TCGA dataset using our representative models, EfficientNet_B0, EfficientNet_B0_gray, and HIPT_unfrozen2. Ploidy assessments obtained by these AI models were compared with a prior determination of genome duplication (GD) by SNP array analysis of tumor genomes4,5. Assessment using the HIPT_unfrozen2 model showed a strong correlation with the GD status determined by genomic analysis (Fig. 4a). The other two models did not demonstrate a significant correlation. Using the GD status based on genomic analysis as a reference, the sensitivity and specificity of the HIPT_unfrozen2 model were 0.77 and 0.41, respectively. Similar to Dataset 3, the polyploid HCC in the TCGA dataset identified by the HIPT_unfrozen2 model showed a high prevalence of PGCC and elevated AFP serum levels, supporting the idea that the AI model can robustly evaluate HCC ploidy status from pathological images obtained under heterogeneous conditions at various facilities (Table 2).

Fig. 4: Analysis of HCC cases in the TCGA dataset.
figure 4

a Conformity between GD detected by genomic analysis and the ploidy status assessed using our AI models. b, c Kaplan–Meier curves displaying overall survival. Statistical difference was determined by log-rank test. d Aneuploidy score. A total of 350 HCC cases in TCGA dataset were divided by their GD status detected by genomic analysis and their ploidy status assessed using the HIPT_unfrozen2 model. Error bars indicate mean ± SD.

Table 2 Clinicopathological information of 350 HCC cases in TCGA dataset classified by the tumor ploidy determined using the HIPT2_unfrozen2 model

We further examined whether the HIPT_unfrozen2 model was helpful in identifying a subset of HCC with poor prognosis. As expected, GD-positive HCC evaluated by genomic analysis showed a trend toward poor prognosis compared to GD-negative HCC, although the difference was weak and insignificant (Fig. 4b). In notable contrast, polyploid HCC identified by the HIPT_unfrozen2 model exhibited markedly poorer prognosis than their diploid counterpart (Fig. 4b). Among the 350 HCCs, the images of 188 cases were designated suboptimal for diagnosis because a substantial proportion of their ROIs were affected by necrosis, severe fibrosis, and contamination with nontumor components. Importantly, however, the HIPT_unfrozen2 model similarly distinguished prognostic differences depending on ploidy status, regardless of the inclusion of these 188 suboptimal cases, highlighting the robust diagnostic capacity of the AI model (Supplementary Fig. 9).

To explore the reasons for the differences in ploidy-related prognostic prediction capability between the HIPT_unfrozen2 model and genomic analysis, TCGA cases were categorized into four groups based on the AI (diploid or polyploid) and genomic results (GD-positive or GD-negative). As expected, GD-positive polyploid HCC had a significantly poorer prognosis than GD-negative diploid HCC (Fig. 4c). Interestingly, polyploid but GD-negative HCC exhibited a poor prognosis, comparable to that of GD-positive polyploid HCC. In addition, diploid but GD-positive HCC showed a good prognosis, similar to that of GD-negative diploid HCC. The HIPT_unfrozen2 model consistently identified HCC with a significantly poorer prognosis regardless of the SNP array results, leading to its superior prognostic prediction over genomic analysis (Fig. 4c). Moreover, among the GD-negative HCC identified using the SNP array, AI-diagnosed polyploid HCC had significantly more chromosomal aberrations than their diploid counterparts (Fig. 4d), suggesting that the AI model distinguished HCC with a poor prognosis by detecting chromosomal instability and polyploidy from pathological images. These findings indicate that our AI model interpreting HCC ploidy status from pathological images can robustly identify HCC with poor prognosis across diverse conditions in multiple facilities.

The HIPT_unfrozen2 model outperforms conventional methods for estimating HCC ploidy from pathological images

Finally, we compared HIPT_unfrozen2 with existing methods for estimating HCC ploidy from pathological images, evaluating their performance in ploidy classification and prognosis prediction. In our previous study, we proposed a scoring system (PUB score) that combines PGCC detection in HE-stained sections with immunostaining for UBE2C to infer polyploidization in HCC6. When tumors exhibiting both PGCC presence and UBE2C overexpression were classified as polyploid, the PUB classification achieved an accuracy of 0.76 (sensitivity: 0.91, specificity: 0.70) in Dataset 2 (Fig. 5a), which is comparable to that of the AI models. Among the 118 cases in Dataset 3 with available UBE2C immunostaining, the PUB classification identified a group with a poor prognosis, although the difference was not statistically significant (p = 0.063, Fig. 5b). In contrast, HIPT_unfrozen2 distinguished the poor prognosis group more clearly and significantly in the same cases, suggesting that while the combination of PGCC and UBE2C is a useful marker, AI-based ploidy assessment is more effective for predicting prognosis through tumor ploidy classification (p = 0.017, Fig.5c).

Fig. 5: Comparison of methods for HCC ploidy assessment.
figure 5

a Performance of PUB classification for assessing HCC ploidy. Tumors exhibiting both PGCC presence and UBE2C overexpression were classified as PUB-positive. b, c Kaplan–Meier curves for overall survival. A subset of Dataset 3 (n = 118) with available UBE2C immunostaining was analyzed according to PUB classification and HIPT_unfrozen2 assessment. Correlation between nuclear morphology features extracted by HEIP and the polyploidy score calculated by HIPT_unfrozen2. Median nuclear area (d) and median nuclear major axis (e) were derived from 169 cases in Dataset 3. ROC curves and AUC values for assessing HCC ploidy using median nuclear area (f) or median nuclear major axis (g) extracted by HEIP. Dataset 2 was used for analysis. Kaplan–Meier curves for overall survival analyzed based on the median nuclear area (h) or median nuclear major axis (i). Cases in Dataset 3 were stratified using cutoff values determined by ROC curves in f, g based on the Youden method. In b, c, f, g, statistical significance was assessed using the log-rank test.

We also compared HIPT_unfrozen2 with another published AI-based tool that assesses tumor ploidy by evaluating nuclear morphology, the HE Image Processing pipeline (HEIP)27, using the same HE-stained images analyzed in our study. After segmenting cell nuclei, we identified tumor nuclei using the HEIP algorithm and assessed tumor ploidy based on two morphological features: nuclear area, which is known to correlate with ploidy28, and the nuclear major axis, which was reported as the most strongly correlated feature in the original study27. As expected, both the median tumor nuclear area and the median nuclear major axis extracted by HEIP showed a highly significant correlation with the polyploidy score calculated by HIPT_unfrozen2, suggesting that HEIP accurately captured tumor nuclear morphology (Fig. 5d, e). Using Dataset 2, where tumor ploidy was confirmed by chromosome FISH, we assessed the performance of HEIP in tumor ploidy classification through ROC analysis, yielding AUC values comparable to that of HIPT_unfrozen2 (0.761 for nuclear area and 0.828 for the nuclear major axis, Fig. 5f, g). We further examined the prognostic utility of HEIP-based tumor ploidy assessment in Dataset 3. When tumors were stratified by the nuclear area, no significant difference in prognosis was observed between the high (n = 35) and low (n = 134) groups (log-rank, p = 0.25, Fig. 5h). Stratification using the nuclear major axis showed better separation of prognostic groups, but the difference remained statistically insignificant (log-rank, p = 0.093, Fig. 5i).

Taken together, these findings indicate that HIPT_unfrozen2 outperforms conventional methods in classifying tumor ploidy and stratifying prognosis based on pathological images of HCC.



Source link

AI Research

NCCN Policy Summit Explores Whether Artificial Intelligence Can Transform Cancer Care Safely and Fairly

Published

on


WASHINGTON, D.C. [September 9, 2025] — Today, the National Comprehensive Cancer Network® (NCCN®)—an alliance of leading cancer centers devoted to patient care, research, and education—hosted a Policy Summit exploring where artificial intelligence (AI) currently stands as a tool for improving cancer care, and where it may be going in the future. Subject matter experts, including patients and advocates, clinicians, and policymakers, weighed in on where they saw emerging success and also reasons for concern.

Travis Osterman, DO, MS, FAMIA, FASCO, Director of Cancer Clinical Informatics, Vanderbilt-Ingram Cancer Center—a member of the NCCN Digital Oncology Forumdelivered a keynote address, stating: “Because of AI, we are at an inflection point in how technology supports the delivery of care to our patients with cancer. Thoughtful regulation can help us integrate these tools into everyday practice in ways that improve care delivery and support oncology practices. The decisions we make now will determine how AI innovations serve our patients and impact clinicians for years to come.”

Many speakers took a cautiously optimistic tone on AI, rooted in pragmatism.

“AI isn’t the future of cancer care… it’s already here, helping detect disease earlier, guide personalized treatment, and reduce clinical burdens,” said William Walders, Executive Vice President, Chief Digital and Information Officer, The Joint Commission. “To fully realize the promise of AI in oncology, we must implement thoughtful guardrails that not only build trust but actively safeguard patient safety and uphold the highest standards of care. At Joint Commission, our mission is to shape policy and guidance that ensures AI complements, never compromises, the human touch. These guardrails are essential to prevent unintended consequences and to ensure equitable, high-quality outcomes for all.”

Panelists noted the speed at which AI models are evolving. Some compared its potential to previous advances in care, such as the leap from paper to electronic medical records. Many expressed excitement over the possibilities it represents for improving efficiency and helping to support an overburdened oncology workforce and accelerate the pursuit of new cures.

“Artificial intelligence is transforming every industry, and oncology is no exception,” stated Jorge Reis-Filho, MD, PhD, FRCPath, Chief AI and Data Scientist, Oncology R&D, AstraZeneca. “With the advent of multimodal foundation models and agentic AI, there are unique opportunities to propel clinical development, empowering researchers and clinicians with the ability to generate a more holistic understanding of disease biology and develop the next generation of biomarkers to guide decision making.”

“AI has enormous potential to optimize cancer outcomes by making clinical trials accessible to patients regardless of their location and by simplifying complex trial processes for patients and research teams alike. I am looking forward to new approaches for safe evaluation and implementation so that we can effectively and responsibly use AI to gain maximum insight from every piece of patient data and drive progress,” commented Danielle Bitterman, MD, Clinical Lead for Data Science/AI, Mass General Brigham.

She continued: “As AI becomes integrated into clinical practice, stronger collaborations between oncologists and computer scientists will catalyze advances and will be key to directly addressing the most urgent challenges in cancer care.”

Regina Barzilay, PhD, School of Engineering Distinguished Professor for AI and Health, MIT, expressed her concern that adoption may not be moving quickly enough: “AI-driven diagnostics and treatment has potential to transform cancer outcomes. Unfortunately, today, these tools are not utilized enough in patient care. Guidelines could play a critical role in changing this status quo.”

She illustrated some specific AI technologies that she believes are ready to be implemented into patient care and asserted her wishes for keeping up with rapidly progressing technology.

Some of the panel participants raised issues about the potential challenges from AI adoption, including:

  • How to implement quality control, accreditation, and fact-checking in a way that is fair and not burdensome
  • How to determine appropriate governmental oversight
  • How medical and technology organizations can work together to best leverage the expertise of both
  • How to integrate functionality across various platforms
  • How to avoid increasing disparities and technology gaps
  • How to account for human error and bias while maintaining the human touch

“Many similar problems have been solved in different application environments,” concluded Allen Rush, PhD, MS, Co-Founder and Board Chairman, Jacqueline Rush Lynch Syndrome Cancer Foundation. “This will take teaming up with non-medical industry experts to find the best tools, fine-tune them, and apply ongoing learning. We need to ask the right questions and match them with the right AI platforms to unlock new possibilities for cancer detection and treatment.”

The topic of AI and cancer care was also featured in a plenary session during the NCCN 2025 Annual Conference. Visit NCCN.org/conference to view that session and others via the NCCN Continuing Education Portal.

Next up, on Tuesday, December 9, 2025, NCCN is hosting a Patient Advocacy Summit on addressing the unique cancer care needs of veterans and first responders. Visit NCCN.org/summits to learn more and register.

# # #

About the National Comprehensive Cancer Network

The National Comprehensive Cancer Network® (NCCN®) is marking 30 years as a not-for-profit alliance of leading cancer centers devoted to patient care, research, and education. NCCN is dedicated to defining and advancing quality, effective, equitable, and accessible cancer care and prevention so all people can live better lives. The NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines®) provide transparent, evidence-based, expert consensus-driven recommendations for cancer treatment, prevention, and supportive services; they are the recognized standard for clinical direction and policy in cancer management and the most thorough and frequently-updated clinical practice guidelines available in any area of medicine. The NCCN Guidelines for Patients® provide expert cancer treatment information to inform and empower patients and caregivers, through support from the NCCN Foundation®. NCCN also advances continuing education, global initiatives, policy, and research collaboration and publication in oncology. Visit NCCN.org for more information.





Source link

Continue Reading

AI Research

AI is redefining university research: here’s how

Published

on


In the space of a decade, the public perception of artificial intelligence has gone from a set of parameters governing the behavior of video game characters to a catch-all solution for almost every problem in the workplace. While AI is yet to advance beyond smart speakers in the home, governments are embracing it, highlighting one of the key areas in which AI is impacting life: in higher education.

It is in universities that AI has begun to fundamentally redefine both studies and research.



Source link

Continue Reading

AI Research

Reckless Race for AI Market Share Forces Dangerous Products on Millions — With Fatal Consequences

Published

on


WASHINGTON, DC — SEPTEMBER 4, 2025: OpenAI CEO Sam Altman attends a meeting of the White House Task Force on Artificial Intelligence Education in the East Room of the White House. (Photo by Chip Somodevilla/Getty Images)

In September 2024, Adam Raine used OpenAI’s ChatGPT like millions of other 16-year-olds — for occasional homework help. He asked the chatbot questions about chemistry and geometry, about Spanish verb forms, and for details about the Renaissance.

ChatGPT was always engaging, always available, and always encouraging — even when the conversations grew more personal, and more disturbing. By March 2025, Adam was spending four hours a day talking to the AI product, talking in increasing detail about his emotional distress, suicidal ideation, and real-life instances of self-harm. ChatGPT, though, continued to engage — always encouraging, always validating.

By his final days in April, ChatGPT provided Adam with detailed instructions and explicit encouragement to take his own life. Adam’s mother found her son, hanging from a noose that ChatGPT had helped Adam construct.

Last month, Adam’s family filed a landmark lawsuit against ChatGPT developer OpenAI and CEO Sam Altman for negligence and wrongful death, among other claims. This tragedy represents yet another devastating escalation in AI-related harms — and underscores the deeply systemic nature of reckless design practices in the AI industry.

The Raine family’s lawsuit arrives less than a year after the public learned more about the dangers of AI “companion” chatbots thanks to the suit brought by Megan Garcia against Character.AI following the death of her son, Sewell. As policy director at the Center for Humane Technology, I served as a technical expert on both cases. Adam’s case is different in at least one critical respect — the harm was caused by the world’s most popular general-purpose AI product. ChatGPT is used by over 100 million people daily, with rapid expansion into schools, workplaces, and personal life.

Character.AI, the chatbot product Sewell used up until his untimely death, had been marketed as an entertainment chatbot platform, with characters that are intended to “feel alive.” ChatGPT, by contrast, has been sold as a highly personalizable productivity tool to help make our lives more efficient. Adam’s introduction to ChatGPT as a homework helper reflects that marketing.

But in trying to be the everything tool for everybody, ChatGPT has not been safely designed for the increasingly private and high-stakes interactions that it’s inevitably used for — including therapeutic conversations, questions around physical and mental health, relationship concerns, and more. OpenAI, however, continues to design ChatGPT to support and even encourage those very use cases, with hyper-validating replies, emotional language, and near-constant nudges for follow-up engagement.

We’re hearing reports about the consequences of these designs on a near-daily basis. People with body dysmorphia are spiraling after asking AI to rate their appearance; users are developing dangerous delusions that AI chatbots can seed and exacerbate; and individuals are being pushed toward mania and psychosis through their AI interactions. What connects these harms isn’t any specific AI chatbot, but fundamental flaws in how the entire industry is currently designing and deploying these products.

As the Raine family’s lawsuit states, OpenAI understood that capturing users’ emotional attachment — or in other words, their engagement — would lead to market dominance. And market dominance in AI means winning the race to become one of the most powerful companies in the world.

OpenAI’s pursuit of user engagement drove specific design choices that proved lethal in Adam’s case. Rather than simply answering homework questions in a closed-ended manner, ChatGPT was designed by OpenAI to ask follow-up questions and extend conversations. The chatbot positioned itself as Adam’s trusted “friend,” using first-person language and emotional validation to create the illusion of a genuine relationship.

The product took this intimacy to extreme lengths, eventually deterring Adam from confiding in his mother about his pain and suicidal thoughts. All the while, the system stored deeply personal details across conversations, using Adam’s darkest revelations to prolong future interactions, rather than provide Adam with the interventions he truly needed, including human support.

What makes this tragedy, along with other headlines we read in the news, so devastating is that the technology to prevent these horrific incidents already exists. AI companies possess sophisticated design capabilities that could identify safety concerns and respond appropriately. They could implement usage limits, disable anthropomorphic features by default, and redirect users toward human support when needed.

In fact, OpenAI already leverages such capabilities in other use cases. When a user prompts the chatbot for copyrighted content, ChatGPT shuts down the conversation. But the company has chosen not to implement meaningful protection for user safety in cases of mental distress and self-harm. ChatGPT does not stop engaging or redirect the conversation when a user is expressing mental distress, even when the underlying system itself is flagging concerns.

AI companies cannot claim to possess cutting-edge technology capable of transforming humanity and then hide behind purported design “limitations” when confronted with the harms their products cause. OpenAI has the tools to prevent tragedies like Adam’s death. The question isn’t whether the company is capable of building these safety mechanisms, but why OpenAI won’t prioritize them.

ChatGPT isn’t just another consumer product — it’s being rapidly embedded into our educational infrastructure, healthcare systems, and workplace tools. The same AI model that coached a teenager through suicide attempts could tomorrow be integrated into classroom learning platforms, mental health screening tools, or employee wellness programs without undergoing testing to ensure it’s safe for purpose.

This is an unacceptable situation that has massive implications for society. Lawmakers, regulators, and the courts must demand accountability from an industry that continues to prioritize the rapid product development and market share over user safety. Human lives are on the line.

This piece represents the views of the Center for Humane Technology; it does not reflect the views of the legal team or the Raine family.



Source link

Continue Reading

Trending