Transcript Workflow

From The Portal Wiki
Revision as of 20:21, 19 April 2020 by BeefSandwich27 (talk | contribs)
Jump to navigation Jump to search

This is where we coordinate to get transcripts of every episode completed. If you want to help, join The Portal Transcripts Discord. This is part of the Transcript Completion Project.

Getting Started

Add Podcast Metadata to Wiki

  • Create a page for the podcast episode
  • Add the thumbnail using the format of this URL. Example: maxresdefault.jpg
  • Buttons: next, prev, mp3, art19

Join the Transcripts Discord

Pick an Episode that You'd Like to Work On

See What Status It's In

Working on a Transcript

Make a Machine Readable Version

  • Get the MP3 (from the episode wiki page)
  • Import the MP3 into Descript
  • Export a VTT file from Descript
    • Check the "Remove ums/uhs Checkbox"
  • Upload VTT file to wiki
  • Link VTT file in The Portal Media Progress

Run Through Alex's VTT Parsing Program

  • Get Alex to explain how the program works
  • Export MediaWiki-friendly text
  • Paste into Wiki

Make Human-Made Version

  • Add your name to the "Responsibility" column in The Portal Media Progress, so people know who to contact with question.
  • Make edits to the wiki. All progress will be tracked in the "View History" tab.
  • Once you feel like you're done, change the Transcript Status to "Human-Made" in The Portal Media Progress.

Update VTT file with Human-Made Version

Run Through VTT Parser to Make Wiki-Friendly Text

  • Paste new version to wiki.
  • Update "Transcript Notes".

Making a Transcript (Old. Delete me.)

The programs below have been useful for automatically generating a transcription of audio or video, which then only needs editing to properly identify speakers, fix grammatical issues, and be made human-readable. Once this has been done transcripts can be exported as Microsoft Word documents, which allows for easy find-and-replace editing that implements wiki-formatting syntax before copying the finalized transcript onto its respective wiki page. In addition, exporting and uploading to the wiki a .vtt version of the transcript allows it to be used in many of our community projects. Some of the listed programs have free trials/transcription time, so don't hesitate to get involved!

Each time a new YouTube video is released, we need to take the audio from that and upload it to the CDN on the wiki. This will ensure that all of our time codes are matching up.

Annotating a Transcript

Progress

The Portal Podcast Human Readable Transcripts

# Title Transcript Status Wiki Page Status
31 31: to be broadcast No transcript. Needs transcript with linking and subcategories.
30 30: Ross Douthat - The Rave before the Fall No transcript. Needs transcript with linking and subcategories.
29 29: Jamie Metzl - The Bio-Hacker will see you now, Ready or Not No transcript. Needs transcript with linking and subcategories.
Special 1 A Portal Special Presentation- Geometric Unity: A First Look Transcript needs editing. File:Geometric-Unity-A-First-Look - YouTube.vtt
28 28: Eric Lewis - The Singular Genius of Elew Transcript incomplete and need machine-readable transcript. Needs transcript with linking and subcategories.
27 27: Daniel Schmachtenberger - On Avoiding Apocalypses Transcript incomplete and need machine-readable transcript. Needs transcript with linking and subcategories.
26 26: James O’Keefe: What is (and isn't) Journalism in the 21st century Machine-readable transcript complete, need human-readable transcript. Needs transcript with linking and subcategories.
25 25: The Construct: Jeffrey Epstein Transcript finished and linked below. Needs linking and subcategories.
24 24: Kai Lenny - To Play and Flirt with Giants Auto-transcription complete. Ping @Jayg#7232 for access to edit in Descript Needs linking in the transcript section. Transcript section needs subcategories by topic.
23 23: Agnes Callard - Courage, Meta-cognitive detachment and their limits Auto-transcription complete. Ping @Jayg#7232 for access to edit in Descript Needs transcript with linking and subcategories.
22 22: Ben Greenfield - Wheat From Chaff in Human Fitness Transcript finished and linked below Needs linking and subcategories.
21 21: Ashley Mathews (aka Riley Reid) - The mogul and brains behind America's Sweetheart Transcript finished and linked below Needs linking.
20 20: Sir Roger Penrose - Plotting the Twist of Einstein’s Legacy Transcript complete, needs editing and cleaning up. Needs linking in the transcript section. Needs Sponsors section.
19 19: Bret Weinstein - The Prediction and the DISC Transcript complete. Needs linking in the transcript section. Transcript section needs subcategories by topic.
18 18: Slipping the DISC: State of The Portal & Chapter 2020 Transcript generated. Needs linking in the transcript section.
17 17: Anna Khachiyan - Reconstructing The Mystical Feminine From The Ashes Of “The Feminine Mystique” Transcript finished and linked below Finished compiling words, references, and names. Needs linking and subcategories.
16 16: Tyler Cowen - The Revolution Will Not Be Marginalized Transcript generated but not complete, needs much cleaning, formatting, and speaker identification. Needs linking in the transcript section. Transcript section needs subcategories by topic. Needs Sponsors section.
15 15: Garrett Lisi - My Arch-nemesis, Myself Transcript generated but not complete, needs much cleaning and formatting, need machine-readable transcript. Needs linking in the transcript section. Transcript section needs subcategories by topic.
14 14: London Tsai - The Reclusive Dean of The New Escherians Machine-readable transcript complete, need human-readable transcript. Needs transcript with linking and subcategories.
13 13: Garry Kasparov - Avoiding Zugzwang in AI and Politics No transcript. Needs transcript with linking and subcategories.
12 12: Vitalik Buterin - The Ethereal Prince and His Virtual Machine Transcript generated but not complete, needs much cleaning and speaker identification. No machine-readable transcript. Needs linking in the transcript section. Transcript section needs subcategories by topic.
11 11: Sam Harris - Fighting with Friends Transcript complete, needs cleaning up. Needs linking in the transcript section. Transcript section needs subcategories by topic.
10 10: Julie Lindahl: Shaking the poisoned fruit of shame out of the family tree Transcript generated with speaker identification and timestamps. Editing for readability, spelling, and grammar would be helpful. Needs linking and subcategories.
9 9: Bryan Callen - Cracking Wise No transcript. Needs transcript with linking and subcategories.
8 8: Andrew Yang - The Dangerously Different Candidate The Media Wants You To Ignore Transcript complete. Needs linking in the transcript section. Transcript section needs subcategories by topic.
7 7: Bret Easton Ellis - The Dark Laureate of Generation X No transcript. Needs transcript with linking and subcategories.
6 6: Jocko Willink - The Way of the Violent Intellectual Transcript generated with speaker identification and timestamps. Editing for readability, spelling, and grammar would be helpful. Needs linking and subcategories.
5 5: Rabbi Wolpe - “So a Rabbi and an atheist walk into a podcast...” Transcript generated with speaker identification and timestamps. Editing for readability, spelling, and grammar would be helpful. Needs linking and subcategories. Needs Sponsors section.
4 4: Timur Kuran - The Economics of Revolution and Mass Deception Transcript generated with speaker identification and timestamps. Editing for readability, spelling, and grammar would be helpful. Needs linking and subcategories.
3 3: Werner Herzog Transcript complete. Needs linking in the transcript section. Transcript section needs subcategories by topic.
2 2: What Is The Portal? Transcript complete. Needs linking in the transcript section. Transcript section needs subcategories by topic.
1 1: Peter Thiel Transcript complete. Needs linking in the transcript section.
0 0: Welcome to The Portal Transcript complete. Wiki page complete.

The Portal Podcast .vtt Files

# Title File File Status
31 31: to be broadcast - .vtt file needed
30 30: Ross Douthat - The Rave before the Fall - .vtt file needed
29 29: Jamie Metzl - The Bio-Hacker will see you now, Ready or Not - .vtt file needed
Special 1 A Portal Special Presentation- Geometric Unity: A First Look Special 1 VTT File .vtt file needs editing
28 28: Eric Lewis - The Singular Genius of Elew - .vtt file needed
27 27: Daniel Schmachtenberger - On Avoiding Apocalypses - .vtt file needed
26 26: James O’Keefe: What is (and isn't) Journalism in the 21st century Episode 26 VTT File Finished
25 25: The Construct: Jeffrey Epstein Episode 25 VTT File Finished
24 24: Kai Lenny - To Play and Flirt with Giants - .vtt file needed
23 23: Agnes Callard - Courage, Meta-cognitive detachment and their limits - .vtt file needed
22 22: Ben Greenfield - Wheat From Chaff in Human Fitness Episode 22 VTT File Finished
21 21: Ashley Mathews (aka Riley Reid) - The mogul and brains behind America's Sweetheart Episode 21 VTT File Finished
20 20: Sir Roger Penrose - Plotting the Twist of Einstein’s Legacy - .vtt file needed
19 19: Bret Weinstein - The Prediction and the DISC Episode 19 VTT File Finished
18 18: Slipping the DISC: State of The Portal & Chapter 2020 - .vtt file needed
17 17: Anna Khachiyan - Reconstructing The Mystical Feminine From The Ashes Of “The Feminine Mystique” Episode 17 VTT File Finished
16 16: Tyler Cowen - The Revolution Will Not Be Marginalized - .vtt file needed
15 15: Garrett Lisi - My Arch-nemesis, Myself - .vtt file needed
14 14: London Tsai - The Reclusive Dean of The New Escherians - .vtt file needed
13 13: Garry Kasparov - Avoiding Zugzwang in AI and Politics - .vtt file needed
12 12: Vitalik Buterin - The Ethereal Prince and His Virtual Machine - .vtt file needed
11 11: Sam Harris - Fighting with Friends Episode 11 VTT File Speakers and timestamps aligned. May require cleaning and removal of filler words.
10 10: Julie Lindahl: Shaking the poisoned fruit of shame out of the family tree Episode 10 VTT File Speakers and timestamps aligned. May require editing for grammar, punctuation, and spelling.
9 9: Bryan Callen - Cracking Wise - .vtt file needed
8 8: Andrew Yang - The Dangerously Different Candidate The Media Wants You To Ignore Episode 8 VTT File Finished
7 7: Bret Easton Ellis - The Dark Laureate of Generation X - .vtt file needed
6 6: Jocko Willink - The Way of the Violent Intellectual Episode 6 VTT File Speakers and timestamps aligned. May require editing for grammar, punctuation, and spelling.
5 5: Rabbi Wolpe - “So a Rabbi and an atheist walk into a podcast...” Episode 5 VTT File Speakers and timestamps aligned. May require editing for grammar, punctuation, and spelling.
4 4: Timur Kuran - The Economics of Revolution and Mass Deception Episode 4 VTT File Speakers and timestamps aligned. May require editing for grammar, punctuation, and spelling.
3 3: Werner Herzog Episode 3 VTT File Finished
2 2: What Is The Portal? Episode 2 VTT File Finished
1 1: Peter Thiel Episode 1 VTT File Finished
0 0: Welcome to The Portal Episode 0 VTT File Finished

Other Media

Create a table here for other media the community finds relevant or useful.