Say what? Collaborative pop lyric generation using multitask transfer learning

Naveen Ram, Tanay Gummadi, Rahul Bhethanabotla, Richard J. Savery, Gil Weinberg

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

1 Citation (Scopus)

Abstract

Lyric generation is a popular sub-field of natural language generation that has seen growth in recent years. Pop lyrics are of unique interest due to the genre's unique style and content, in addition to the high level of collaboration that goes on behind the scenes in the professional pop songwriting process. In this paper, we present a collaborative line-level lyric generation system that utilizes transfer-learning via the T5 transformer model, which, till date, has not been used to generate pop lyrics. By working and communicating directly with professional songwriters, we develop a model that is able to learn lyrical and stylistic tasks like rhyming, matching line beat requirements, and ending lines with specific target words. Our approach compares favorably to existing methods for multiple datasets and yields positive results from our online studies and interviews with industry songwriters.
Original languageEnglish
Title of host publicationProceedings of the 9th International Conference on Human-Agent Interaction (HAI '21)
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages165-173
Number of pages9
ISBN (Electronic)9781450386203
DOIs
Publication statusPublished - Nov 2021
Externally publishedYes
EventInternational Conference on Human-Agent Interaction (9th : 2021) - , Japan
Duration: 9 Nov 202111 Nov 2021

Conference

ConferenceInternational Conference on Human-Agent Interaction (9th : 2021)
Abbreviated titleHAI '21
Country/TerritoryJapan
Period9/11/2111/11/21

Keywords

  • collaborative AI
  • lyric generation
  • natural language generation
  • pop music
  • transfer learning
  • transformers

Fingerprint

Dive into the research topics of 'Say what? Collaborative pop lyric generation using multitask transfer learning'. Together they form a unique fingerprint.

Cite this