• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
Montreal AI Ethics Institute

Montreal AI Ethics Institute

Democratizing AI ethics literacy

  • Articles
    • Public Policy
    • Privacy & Security
    • Human Rights
      • Ethics
      • JEDI (Justice, Equity, Diversity, Inclusion
    • Climate
    • Design
      • Emerging Technology
    • Application & Adoption
      • Health
      • Education
      • Government
        • Military
        • Public Works
      • Labour
    • Arts & Culture
      • Film & TV
      • Music
      • Pop Culture
      • Digital Art
  • Columns
    • AI Policy Corner
    • Recess
    • Tech Futures
  • The AI Ethics Brief
  • AI Literacy
    • Research Summaries
    • AI Ethics Living Dictionary
    • Learning Community
  • The State of AI Ethics Report
    • Volume 7 (November 2025)
    • Volume 6 (February 2022)
    • Volume 5 (July 2021)
    • Volume 4 (April 2021)
    • Volume 3 (Jan 2021)
    • Volume 2 (Oct 2020)
    • Volume 1 (June 2020)
  • About
    • Our Contributions Policy
    • Our Open Access Policy
    • Contact
    • Donate

The Ethical Need for Watermarks in Machine-Generated Language

November 27, 2022

🔬 Research summary by Connor Wright, our Partnerships Manager.

[Original paper by A. Grinbaum and L. Adomaitis]


Overview: With the ability of large language models to reproduce text becoming more prominent (such as Meta’s Galactica and GPT-3), it becomes increasingly cumbersome to distinguish between machine-generated and human-generated text. Consequently, the authors propose a watermark technique to separate the two to avoid the grave dangers of manipulation.


Introduction

Large language models are being treated as one of the research areas in AI with the greatest promise (where AI companies in healthcare have increased the portion of their budget dedicated to these models). Consequently, mastering AI-generated texts is becoming increasingly difficult to distinguish the machine-generated and human-generated text. As a result, the authors propose focusing on watermarking methods to preserve this distinction. What results is the problem of “indistinguishability.”

Key Insights

The problem of “indistinguishability”

Since the Turing test, “[i]ndistinguishability” (p. 2) has become the benchmark for positive AI performance. However, the current regulation does not clearly distinguish between human-generated and machine-generated text. The latter does not possess the critical thinking and reflection of the former. In this way, AI can indeed reproduce text, but it will not be able to reflect the meaning of the text it produces. That is to say, it may be able to produce scientific papers like other language models such as Galactica, but it will not possess the ability to acknowledge it is producing fake news. Without a clear distinction between the two, it becomes difficult to impose sanctions on instances of manipulation.

Manipulation

Tying into the debate surrounding truth and deepfakes, AI-generated pieces are not as trustworthy. They cannot be held to the same account as human-generated pieces, according to the authors. Given the opportunity to manipulate at scale, AI can use emotionally charged language to influence the user. Within the text, this is demonstrated in the conversation between Joshua and Jessica, with Jessica (the language model) using specific terminology to persuade Joshua that it is Jessica lives on despite passing away. These types of stories not only taint the possibility of detecting what is true but also a positive view of technology. Hence, distinguishability is crucial not only for preserving truth but also for the beneficial view of technology.

To preserve these two perspectives, the authors propose the watermarking technique.

Watermarking

Watermarking techniques look to provide clear signs of whom the piece of writing in question belongs to. These techniques include a hash function to generate a specific bit sequence (a row of 1s and 0s) alongside steganographic approaches (hiding a secret message within a normal body of text). 

The authors propose equidistant language sequencing as a non-intrusive watermarking method. This is where a word/letter is repeated at regular intervals in the text without interrupting the user’s reading experience. For example, the language model may repeat the letter ‘a’ every 70 characters to signal that it is machine-generated. This may not be visible to the human eye immediately, but it will serve interested parties well when required to distinguish the machine-generated and human-generated text.

Between the lines

The watermarking technique, while in its infancy, is important to consider. Even when we know we are talking to a machine, we still tend to project mental states onto the AI and anthropomorphize the technology. Hence, a discernable sign to help prevent emotional manipulation by machines will prove a crucial step in the ethical safeguards proposed for these technologies.

However, the type of watermark proposed may need to be clearer. While we don’t want to muddle the dataset by adjusting the data to contain a visible label, requiring someone else to decipher the watermark may prove an avoidable step. Instead, having the piece labeled as ‘machine-generated’ like in the self-disclosure agreement in the California chatbot law could prove a helpful step in the fight against manipulation.

Want quick summaries of the latest research & reporting in AI ethics delivered to your inbox? Subscribe to the AI Ethics Brief. We publish bi-weekly.

Primary Sidebar

🔍 SEARCH

Spotlight

Illustration of a coral reef ecosystem

Tech Futures: Diversity of Thought and Experience: The UN’s Scientific Panel on AI

This image shows a large white, traditional, old building. The top half of the building represents the humanities (which is symbolised by the embedded text from classic literature which is faintly shown ontop the building). The bottom section of the building is embossed with mathematical formulas to represent the sciences. The middle layer of the image is heavily pixelated. On the steps at the front of the building there is a group of scholars, wearing formal suits and tie attire, who are standing around at the enternace talking and some of them are sitting on the steps. There are two stone, statute-like hands that are stretching the building apart from the left side. In the forefront of the image, there are 8 students - which can only be seen from the back. Their graduation gowns have bright blue hoods and they all look as though they are walking towards the old building which is in the background at a distance. There are a mix of students in the foreground.

Tech Futures: Co-opting Research and Education

Agentic AI systems and algorithmic accountability: a new era of e-commerce

ALL IN Conference 2025: Four Key Takeaways from Montreal

Beyond Dependency: The Hidden Risk of Social Comparison in Chatbot Companionship

related posts

  • The State of AI Ethics Report

    The State of AI Ethics Report

  • Research summary: The Wrong Kind of AI? Artificial Intelligence and the Future of Labor Demand

    Research summary: The Wrong Kind of AI? Artificial Intelligence and the Future of Labor Demand

  • Prediction Sensitivity: Continual Audit of Counterfactual Fairness in Deployed Classifiers

    Prediction Sensitivity: Continual Audit of Counterfactual Fairness in Deployed Classifiers

  • Deciphering Open Source in the EU AI Act

    Deciphering Open Source in the EU AI Act

  • AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in...

    AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in...

  • Why was your job application rejected: Bias in Recruitment Algorithms? (Part 2)

    Why was your job application rejected: Bias in Recruitment Algorithms? (Part 2)

  • The Brussels Effect and AI: How EU Regulation will Impact the Global AI Market

    The Brussels Effect and AI: How EU Regulation will Impact the Global AI Market

  • Assessing the nature of large language models: A caution against anthropocentrism

    Assessing the nature of large language models: A caution against anthropocentrism

  • Who will share Fake-News on Twitter? Psycholinguistic cues in online post histories discriminate bet...

    Who will share Fake-News on Twitter? Psycholinguistic cues in online post histories discriminate bet...

  • Towards Climate Awareness in NLP Research

    Towards Climate Awareness in NLP Research

Partners

  •  
    U.S. Artificial Intelligence Safety Institute Consortium (AISIC) at NIST

  • Partnership on AI

  • The LF AI & Data Foundation

  • The AI Alliance

Footer


Articles

Columns

AI Literacy

The State of AI Ethics Report


 

About Us


Founded in 2018, the Montreal AI Ethics Institute (MAIEI) is an international non-profit organization equipping citizens concerned about artificial intelligence and its impact on society to take action.

Contact

Donate


  • © 2025 MONTREAL AI ETHICS INSTITUTE.
  • This work is licensed under a Creative Commons Attribution 4.0 International License.
  • Learn more about our open access policy here.
  • Creative Commons License

    Save hours of work and stay on top of Responsible AI research and reporting with our bi-weekly email newsletter.