A Systematic Review and Network Meta-Analysis Β· 64 RCTs Β· 25,572 Participants Β· Bayesian NMA
The largest head-to-head comparison integrating data from 64 randomized controlled trials and 25,572 participants via Bayesian network meta-analysis
Semaglutide β1.5% Β· Liraglutide β1.2%
Semaglutide β2.8 kg Β· Liraglutide β1.2 kg
25,572 patients Β· Systematic search through October 2024
GLP-1 receptor agonists (GLP-1 RAs) β including tirzepatide, semaglutide, and liraglutide β represent a new generation of treatments for type 2 diabetes mellitus (T2DM). These agents work by mimicking the incretin hormone GLP-1, stimulating insulin secretion, suppressing glucagon, slowing gastric emptying, and promoting satiety. Beyond glycemic control, they offer meaningful reductions in body weight, a major comorbidity in T2DM.
This study represents the most comprehensive Bayesian network meta-analysis (NMA) to date, integrating 64 randomized controlled trials (RCTs) encompassing 25,572 participants. Using NMA methodology, the researchers were able to compare drugs that have never been directly tested against each other β providing clinically critical indirect comparisons across nine GLP-1 RA formulations and multiple conventional antidiabetic agents.
Traditional meta-analysis combines data from RCTs that directly compare treatmentsβfor example, "Drug A vs. placebo" or "Drug B vs. placebo." However, if no trial has directly compared Drug A to Drug B, traditional meta-analysis cannot answer "which is better?"
Network Meta-Analysis (NMA) solves this problem. By using data from "A vs. placebo" and "B vs. placebo" trials, NMA can indirectly estimate the relative effect of A vs. B. In this study, 64 RCTs are connected into a network, enabling simultaneous comparison of all GLP-1 agonists against each otherβand against placeboβin a single unified analysis.
The primary analysis focused on HbA1c (glycated hemoglobin, the 3-month average blood glucose indicator) and FPG (fasting plasma glucose). Secondary outcomes included body weight, BMI, blood pressure, lipid profiles, and adverse events including hypoglycemia and gastrointestinal symptoms. The results provide evidence-based guidance for individualized treatment selection in clinical practice.
Type 2 diabetes mellitus (T2DM) is a major global health challenge. According to the International Diabetes Federation (IDF), the number of adults living with diabetes worldwide is projected to reach 643 million by 2030 and 783 million by 2045. The disease carries enormous human and economic costs, encompassing cardiovascular complications, renal failure, neuropathy, and retinopathy.
GLP-1 receptor agonists have emerged as a transformative class of antidiabetic agents. They act by binding to and activating the glucagon-like peptide-1 receptor, thereby stimulating glucose-dependent insulin secretion from pancreatic beta cells, suppressing inappropriate glucagon secretion, slowing gastric emptying to reduce postprandial glucose spikes, and signaling satiety to the brain. These mechanisms combine to produce both glycemic improvement and clinically significant weight loss.
GLP-1 (Glucagon-Like Peptide-1) is a hormone released by the small intestine after meals. It signals the pancreas only when blood sugar is elevatedβthe so-called "incretin effect." GLP-1 receptor agonists are drugs that mimic this natural hormone.
Four pillars of action:
Despite the growing number of approved GLP-1 RAs, direct head-to-head trial data comparing these agents to each other β and to conventional antidiabetic drugs such as metformin, sulphonylureas, DPP-4 inhibitors, and SGLT-2 inhibitors β remain limited. Previous meta-analyses have been constrained by smaller sample sizes, narrower drug comparisons, or shorter follow-up periods.
Tirzepatide (Mounjaro/Zepbound) represents a particularly important advance as a dual GIP/GLP-1 receptor agonist β the first in its class. By activating both glucose-dependent insulinotropic polypeptide (GIP) and GLP-1 receptors simultaneously, tirzepatide achieves greater metabolic effects than single-receptor agonists, as demonstrated in the SURPASS clinical trial program.
Unlike existing GLP-1 receptor agonists, tirzepatide simultaneously activates both GIP and GLP-1 receptorsβmaking it a true "dual agonist." GIP (Glucose-dependent Insulinotropic Polypeptide) is another incretin hormone that also regulates fat tissue metabolism.
This study conducted the largest network meta-analysis to date comparing 8 GLP-1 receptor agonist formulations against each other, conventional antidiabetic agents, and placebo β including both direct and indirect comparisons β to provide comprehensive evidence for optimal treatment selection in T2DM.
The PRISMA-NMA reporting guidelines were followed throughout. The systematic search was prospectively registered in the PROSPERO database (CRD42024595773), ensuring methodological transparency and reproducibility. Five major databases were searched through October 2024, capturing the most current evidence including recently approved agents.
The clinical importance of this NMA lies in its ability to inform individualized treatment decisions: which drug offers the best glycemic control for a given patient profile, which carries the lowest risk of hypoglycemia, and how the magnitude of weight loss differs across agents. These are questions that individual RCTs cannot fully answer.
PRISMA-NMA Guidelines Β· PROSPERO Registration: CRD42024595773
PRISMA flow diagram: 12,074 records identified across 5 databases β 64 RCTs finally included.
Database breakdown: PubMed (n=1,596) Β· Embase (n=3,496) Β· Cochrane (n=4,639) Β· Web of Science (n=2,265) Β· Chinese databases (n=78) β After deduplication: 8,523 β After title/abstract screening: 151 full-texts reviewed β Final inclusion: 64 RCTs (25,572 patients)
Searched PubMed, Cochrane Library, Embase, Web of Science, and Chinese databases (CNKI, Wanfang) through October 2024. Only RCTs with parallel-group design were eligible. Two independent reviewers screened all titles, abstracts, and full texts.
RCTs ONLYNMA integrates both direct (head-to-head) and indirect (via common comparator) evidence, enabling comparisons between drugs never directly tested against each other. SUCRA (Surface Under the Cumulative Ranking Curve) values rank treatments probabilistically. Consistency between direct and indirect evidence was assessed.
When comparing multiple drugs simultaneously in an NMA, we need a probabilistic metric to express "which drug is best." SUCRA is that metric.
Intuitive interpretation: SUCRA = "the expected probability that this drug ranks first among all comparators" (0β100%)
In this paper, tirzepatide achieves SUCRA 93% for HbA1c reductionβmeaning it ranks first in 93% of simulated ranking scenarios across all possible drug combinations.
All 64 RCTs were independently evaluated by two reviewers using the Cochrane Risk of Bias (RoB) tool across 7 domains: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other bias.
Cochrane RoBEligible studies were parallel-group RCTs in adults (aged β₯18 years) with confirmed T2DM comparing at least one GLP-1 RA (tirzepatide, semaglutide, liraglutide, dulaglutide, albiglutide, lixisenatide, exenatide QW, or exenatide BID) against placebo or another antidiabetic treatment (insulin, metformin, sulphonylureas, DPP-4 inhibitors, or SGLT-2 inhibitors). Minimum follow-up duration was 12 weeks.
Primary outcomes were changes from baseline in HbA1c (%) and fasting plasma glucose (FPG, mmol/L). Secondary outcomes included body weight (kg), body mass index (BMI, kg/mΒ²), systolic and diastolic blood pressure, total cholesterol (TC), high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), triglycerides, and safety outcomes including hypoglycemia, nausea, diarrhea, vomiting, constipation, and decreased appetite.
Statistical analysis employed a Bayesian random-effects NMA model. Heterogeneity was assessed using the IΒ² statistic. Sensitivity analyses were conducted by merging semaglutide injection and oral formulations with conventional drug classes where appropriate, to test the robustness of primary findings. Publication bias was assessed with funnel plots for outcomes with sufficient studies.
Conventional frequentist statistics (p-values, 95% CI) calculate "the probability that data arose by chance." Bayesian statistics combines prior knowledge with new data to directly estimate "the probability distribution of the parameter."
Reasons for using Bayesian NMA:
Head-to-head comparison on HbA1c (glycated hemoglobin) and FPG (fasting plasma glucose)
Each node represents a treatment; edge thickness indicates the number of direct comparison trials. Placebo is the most common comparator. Panel A: HbA1c network. Panel B: FPG network.
Vertical axis: individual GLP-1 RA agents. Horizontal axis: mean difference (MD) in HbA1c. Diamond shapes show pooled estimates; horizontal bars represent 95% credible intervals. Agents whose confidence intervals do not cross zero show statistically significant effects.
How to read a forest plot: A point to the LEFT of zero means the drug lowers blood glucose more than placebo (beneficial effect).
If the whiskers (confidence interval) do NOT cross zero, the difference is statistically significant. All major GLP-1 RAs show significant HbA1c reductions vs. placebo.
Forest plot comparing GLP-1 RAs against insulin, metformin, sulphonylureas, DPP-4 inhibitors, and SGLT-2 inhibitors. Tirzepatide demonstrates statistically significant HbA1c advantages over all conventional comparators.
Upper triangle: FPG comparisons. Lower triangle: HbA1c comparisons. The Tirzepatide (Tir) row shows the greatest improvements across all pairwise comparisons. Values represent mean differences with 95% credible intervals.
In the primary NMA for HbA1c, all GLP-1 RAs demonstrated statistically significant reductions compared to placebo. Tirzepatide showed the largest effect (MD: β2.3%, 95% CrI: β2.7 to β1.9), followed by semaglutide injection (MD: β1.5%), semaglutide oral (MD: β1.4%), liraglutide (MD: β1.2%), dulaglutide (MD: β1.1%), albiglutide (MD: β0.88%), exenatide QW (MD: β0.93%), exenatide BID (MD: β0.82%), and lixisenatide (MD: β0.56%).
Compared to conventional antidiabetic agents, tirzepatide significantly outperformed insulin (MD: β1.5%), metformin (MD: β1.4%), sulphonylureas (MD: β1.7%), DPP-4 inhibitors (MD: β1.6%), and SGLT-2 inhibitors (MD: β1.6%). Semaglutide injection and liraglutide also showed significant advantages over sulphonylureas (MD: β0.91% and β0.58%, respectively).
SUCRA rankings for HbA1c reduction placed tirzepatide first (SUCRA: 97.6%), followed by semaglutide injection (87.2%), semaglutide oral (79.4%), liraglutide (65.1%), and dulaglutide (60.3%). The long-acting GLP-1 RAs consistently ranked above short-acting formulations and conventional treatments, reflecting their superior once-weekly dosing and extended receptor engagement.
For FPG reduction, tirzepatide again demonstrated the greatest effect among all treatments (MD: β3.1 mmol/L vs. placebo). Semaglutide injection (MD: β2.0 mmol/L), liraglutide (MD: β1.6 mmol/L), and dulaglutide (MD: β1.5 mmol/L) followed. All long-acting GLP-1 RAs showed significant FPG reductions compared to placebo.
The consistency between direct and indirect evidence was satisfactory for most treatment comparisons. Where inconsistency was identified, it was limited to comparisons with sparse direct evidence, which is expected in any large-scale NMA and does not substantially affect the primary conclusions.
Note: BMI, blood pressure, and lipid parameters (total cholesterol, HDL-C, LDL-C) showed no statistically significant differences versus placebo across GLP-1 RA treatments in this analysis.
Forest plot for body weight change (kg vs. placebo). Tirzepatide demonstrates the largest weight reduction across all comparisons.
NMA forest plot for FPG reduction across all GLP-1 RAs and conventional comparators vs. placebo and active comparators.
Semaglutide, tirzepatide, liraglutide, and dulaglutide all showed significantly higher rates of nausea, diarrhea, vomiting, constipation, and decreased appetite compared to placebo. These GI effects are class-characteristic for long-acting GLP-1 RAs.
GI symptoms are typically transient, occurring most often during dose titration and improving over time.
Beyond weight, several GLP-1 RAs demonstrated effects on cardiometabolic risk factors. Semaglutide injection showed significant reductions in systolic blood pressure versus some comparators. However, across the full NMA, differences in BMI, total cholesterol, HDL-C, LDL-C, and triglycerides between GLP-1 RAs and placebo did not consistently reach statistical significance, suggesting these outcomes may require larger or longer trials to detect.
For all adverse events analyzed, the safety signals were consistent with known class effects and product labels. No unexpected safety findings emerged from this NMA. The overall benefit-risk profile favored GLP-1 RAs over conventional antidiabetic agents for patients with T2DM, particularly when both glycemic and weight-related outcomes are considered together.
A key finding of this NMA is the consistent superiority of long-acting GLP-1 receptor agonists β particularly tirzepatide and semaglutide β over short-acting formulations in reducing HbA1c. Long-acting agents maintain sustained GLP-1 receptor occupancy, which more effectively suppresses hepatic glucose production overnight and reduces fasting glucose, while short-acting agents primarily blunt postprandial glucose excursions.
GLP-1 receptor agonists differ by duration of action:
This study's finding that long-acting agents outperform short-acting agents on both HbA1c and FPG is consistent with their pharmacological mechanism of action.
Tirzepatide's exceptional performance β the highest SUCRA rankings across HbA1c, FPG, and body weight β is attributable to its unique dual mechanism as both a GLP-1 and GIP receptor agonist. The GIP component enhances insulin secretion and may directly promote adipose tissue lipolysis, contributing to the substantially greater weight loss (β9.1 kg) observed compared to single-receptor GLP-1 RAs (β1.2 to β2.8 kg).
Both injectable and oral semaglutide formulations demonstrated comparable glycemic efficacy (HbA1c reduction: injection β1.5%, oral β1.4%). The oral formulation, which uses the absorption enhancer SNAC (sodium N-[8-(2-hydroxybenzoyl)amino]caprylate) to enable intestinal absorption, provides a needle-free option that may improve adherence in patients with needle phobia or injection fatigue.
The once-weekly dosing schedule of both semaglutide formulations aligns with patient preference data showing strong associations between less frequent dosing and treatment adherence in T2DM. This pharmacokinetic advantage, combined with efficacy comparable to daily-dosing agents, reinforces semaglutide's position as a second-line option behind tirzepatide.
Tirzepatide's weight loss effect (β9.1 kg vs. placebo) is substantially larger than other GLP-1 RAs analyzed and approaches the weight reductions seen with dedicated anti-obesity medications. This finding has important clinical implications for the substantial proportion of T2DM patients with concomitant obesity, for whom weight loss is an equally important therapeutic target alongside glycemic control.
The lack of statistically significant differences in BMI, blood pressure, and lipid parameters (TC, HDL-C, LDL-C) compared to placebo across most GLP-1 RAs is notable and contrasts with findings from some individual trials and earlier meta-analyses. This may reflect the heterogeneity of study populations, varying baseline cardiovascular risk profiles, and the shorter follow-up periods common in glycemic control trials compared to dedicated cardiovascular outcomes trials (CVOTs).
The gastrointestinal adverse event profile β dominated by nausea, diarrhea, vomiting, and constipation β is consistent across all long-acting GLP-1 RAs and represents the primary tolerability challenge. These effects are believed to result from slowed gastric motility mediated by GLP-1 receptor activation in the enteric nervous system and are dose-dependent. They typically peak during dose escalation and diminish with continued treatment.
The differential hypoglycemia risk across agents is a clinically meaningful finding. Semaglutide injection's increased hypoglycemia risk (RR: 4.6 vs. placebo) warrants attention, particularly when used in combination with insulin or sulphonylureas. In contrast, liraglutide's significant protective effect against hypoglycemia β a finding that has been observed in some prior cardiovascular outcomes trials β suggests it may be particularly suitable for patients at high hypoglycemia risk.
This NMA has several limitations. Network sparsity in some treatment comparisons limited the power of inconsistency testing. The analysis used aggregate-level data rather than individual patient data (IPD), which precludes subgroup analyses by age, sex, diabetes duration, or baseline HbA1c. Regional and ethnic variation in pharmacogenomics and dietary patterns may influence outcomes not fully captured by pooling international RCTs.
The risk of bias assessment found moderate-to-low risk overall, but performance bias (participant and personnel blinding) was rated high or unclear in a subset of studies β an inherent challenge in trials of injectable agents. The time horizons of included trials (typically 24β52 weeks) may not fully reflect long-term cardiovascular and renal outcomes that have been demonstrated in dedicated CVOTs for some agents.
Largest NMA in class (64 RCTs, 25,572 patients). PRISMA-NMA compliant. Bayesian random-effects model. Covers all major GLP-1 RAs plus conventional comparators. PROSPERO pre-registered.
Network sparsity Β· Aggregate data only Β· Regional variation Β· Moderate performance bias risk Β· Short-term follow-up in some trials
Personalized medicine approach: match drug selection to patient obesity status, hypoglycemia risk tolerance, injection preference, and cost/access considerations.
GLP-1 receptor agonists significantly improve glycemic control in type 2 diabetes, with long-acting formulations demonstrating consistently greater HbA1c and FPG reductions compared to short-acting agents and conventional antidiabetic drugs. This comprehensive Bayesian NMA of 64 RCTs and 25,572 patients provides the most robust evidence base currently available for comparative treatment selection.
Tirzepatide, semaglutide, and liraglutide emerge as the three most effective GLP-1 RAs for glycemic control. Tirzepatide's dual GIP/GLP-1 mechanism confers exceptional advantages in both HbA1c reduction (β2.3%) and body weight loss (β9.1 kg), making it the preferred option when significant weight reduction is a treatment goal. For patients prioritizing hypoglycemia safety, liraglutide offers a distinctive protective profile.
Treatment decisions should be individualized based on patient-specific factors including obesity status, hypoglycemia risk, cardiovascular risk profile, tolerability of GI adverse effects, dosing convenience (injection vs. oral), and access. The evidence from this NMA supports GLP-1 RAs as a first-line or early combination option in T2DM management, particularly for patients where weight management is a co-equal goal.
Maximum glycemic control (β2.3% HbA1c) plus the greatest weight loss (β9.1 kg) among all treatments
FIRST LINEStrong glycemic efficacy with the lowest hypoglycemia risk among GLP-1 RAs β protective vs. conventional drugs
OPTIMAL SAFETYHigh glycemic efficacy; oral formulation available for needle-averse patients. Note elevated hypoglycemia risk
NOTE: Hypoglycemia riskCochrane Risk of Bias (RoB) framework applied to all 64 included RCTs across 7 domains
Green = Low risk Β· Yellow = Unclear risk Β· Red = High risk. Performance bias (participant/personnel blinding) shows some high-risk entries, inherent to injection trials. Overall: moderate-to-low risk evidence base.
Sequence generation and allocation concealment were judged low-risk in the majority of included RCTs, supporting the integrity of randomization.
Participant and personnel blinding was rated high-risk or unclear in some studies, as blinding to injectable treatments is inherently challenging. This represents the primary quality concern.
Selective outcome reporting was judged low-risk in most studies. The overall evidence base is considered suitable for drawing comparative conclusions.
B2B Content
Any content, beautifully transformed for your organization
PDFs, videos, web pages β we turn any source material into production-quality content. Rich HTML Β· Custom slides Β· Animated video.