User preferences for large language model refusals: implications for moderation and market structure

Kireyev, Pavel ORCID: 0009-0004-4776-753X and Vitorino, Maria Ana (2025) User preferences for large language model refusals: implications for moderation and market structure. . London School of Economics and Political Science, London, UK. (Unpublished)

Full text not available from this repository.

Abstract

Large language models (LLMs) differ in their moderation and content policies, which determine which prompts these models refuse to answer. These refusals can affect user decisions of which models to use and whether to make safe or risky prompts. Using data from LMArena, where users select preferred responses to their prompts from paired LLM comparisons, we estimate a discrete choice model that captures user preferences for making risky prompts and their choice of which LLM provides the best response quality given the possibility of refusals. We leverage this model to analyze how moderation policies affect market shares across proprietary and opensource LLMs. Our findings reveal that proprietary LLMs provide higher quality responses and maintain larger market shares, but implement stricter moderation policies with higher refusal rates compared to open-source alternatives. This stricter moderation by proprietary LLMs reduces market concentration by allowing lower-quality open-source LLMs to compete effectively in the risky prompt segment. Mandating uniform moderation policies across all LLMs could increase market concentration favoring proprietary LLMs, potentially hampering competition. Our framework characterizes the efficient frontier of moderation policies that balance market concentration and safety

Item Type:	Monograph (Working Paper)
Additional Information:	© 2025 The Author(s)
Divisions:	Management
Subjects:	Q Science > Q Science (General) H Social Sciences > HD Industries. Land use. Labor H Social Sciences > HF Commerce
Date Deposited:	12 Dec 2025 12:39
Last Modified:	12 Dec 2025 16:21
URI:	http://eprints.lse.ac.uk/id/eprint/130606

Actions (login required)

View Item