Android coming soon

The Android app launches in the coming weeks. Drop us a line at [email protected] to be notified at launch.

← Back to blog
May 18, 2026 Transcribe and analyze your thesis interviews with AI cosmonote.ai

You scheduled fifteen interviews this semester. Each one runs forty minutes. You take notes during, but you know full well you’re missing half of what the participant says — you’re nodding, asking the next question, watching the time. And when you finally sit down to write your analysis, you have a stack of audio files and no real way to navigate them.

This is the bottleneck of qualitative research. The interviews themselves are not the hard part. The hard part is everything that comes after: transcribing, cleaning, identifying who said what, and finding the right verbatim quote when your supervisor asks “could you give me a concrete example of how participant 7 framed their resistance to the new system?”

What changes when you record properly

The first thing recording changes is your presence during the interview. You stop scribbling. You actually listen, follow up on what the person just said, ask the unprepared question that turns the conversation into something useful. Your participant feels heard — and people who feel heard talk more openly.

The second thing it changes is what you walk away with. Instead of pages of half-finished notes, you have an audio file. Audio is dense. Forty minutes of audio is forty minutes of data you can come back to as many times as you need, at any level of detail.

The third thing is reproducibility. Six months from now, when you’re writing your discussion and a question comes up about whether two participants really said similar things or whether you imagined the parallel — you can check. You’re not relying on memory anymore.

How transcription used to work

If you’ve done qualitative research the old way, you know the pain. Either you transcribe yourself, which takes roughly four to six times the duration of the audio (so two and a half to four hours of typing per forty-minute interview), or you pay a transcription service at €1-2 per minute and wait several days. By the time you’ve finished transcribing your twelfth interview, you’ve burned weeks and a noticeable chunk of your scholarship.

The shortcuts that students used to take — paraphrasing, transcribing only “interesting” passages, relying on memory — are exactly the things your jury can call out at the defense. “How do you know you didn’t introduce confirmation bias when you selected which passages to transcribe?” It’s a hard question to answer.

What AI transcription gives you

Cosmonote transcribes a forty-minute interview in roughly forty seconds, with about 99% accuracy on most accents and a clear distinction between speakers. You drop your phone on the table, hit record, and forty minutes later your transcript is on your phone before you’ve put the recorder away.

What you get isn’t just a wall of text. You get speakers labeled — [A], [B], sometimes [C] if a colleague was in the room. You get a thematic summary that highlights what the participant kept returning to. You get a list of decisions or commitments they made, if any. And you get the full word-for-word verbatim, ready to be copy-pasted into NVivo, ATLAS.ti, MAXQDA or just a Word document.

A workflow that actually scales to fifteen interviews

Here’s the workflow that works once you’ve done this a few times:

Before the interview, you set your phone on Do Not Disturb and place it about half a meter from the participant. Cosmonote captures normal conversation fine even in a slightly noisy café, but a quiet room is always better. You don’t need an external microphone.

During the interview, you do nothing different from what you’d normally do. You don’t need to introduce speakers verbally — the diarization will handle that. You can take a few notes if it helps you keep track of follow-up questions, but you’re not relying on them.

After the interview, you tag the recording in Cosmonote with the participant pseudonym and the date. The transcript and summary are ready in a minute or two. You skim the summary to remind yourself of the texture of the conversation while it’s fresh.

When it’s time to analyze, you don’t open the audio at all. You read the transcript, code it in your tool of choice, and use the Ask AI feature when you need to find a specific passage across all your interviews — “Which participants mentioned trust issues with the new tool?” returns the exact quotes from the right interviews.

What about confidentiality

Qualitative research interviews are sensitive by definition. Participants share things they wouldn’t share publicly. Two things matter here.

First, the legal side: under GDPR, you need informed consent before recording. This is something your research ethics committee will have prepared a form for — use it. Tell the participant clearly that you’re recording, what the recording will be used for, how long you’ll keep it, and how they can withdraw. Most people will agree without hesitation, especially in academic contexts.

Second, the technical side: Cosmonote stores audio and transcripts encrypted at rest and in transit, on servers in Paris, France. Only your account can access them. If you delete a note, the underlying audio and transcript are deleted too. This is the same setup the GDPR expects for sensitive data processing.

If your research involves particularly sensitive topics (clinical interviews, victims of harassment, minors), check with your ethics committee — they may require specific additional safeguards.

Exporting for analysis

Once your transcripts are ready, you’ll want to bring them into your analysis tool. From Cosmonote you can copy the full transcript as plain text, share it as a file, or send it via Apple Mail to yourself. In NVivo and ATLAS.ti, importing a plain text file with speaker labels is straightforward — both tools parse [A] / [B] markers as speakers, and you can start coding immediately.

If you prefer to work directly in Word or Google Docs (which is fine for smaller corpora), you can paste the transcript and use the document’s comment feature to start your initial coding pass before deciding whether you need a dedicated CAQDAS tool.

A few practical lessons from people who’ve done this

Number the participants from the moment you schedule them, not after the interview. P01 through P15. It’s a small thing that prevents huge headaches when you cross-reference quotes six months later.

Date every recording in the filename — Cosmonote does this automatically, but if you export and rename, keep the convention YYYY-MM-DD_PXX_topic.txt. Future you will be grateful.

Don’t try to code your first interview during the same week you record it. You’ll over-fit your coding scheme to that one conversation. Wait until you have three or four interviews, then build your scheme from the patterns you see across them.

And finally: the transcription is the floor, not the ceiling. AI gets you to a clean, navigable transcript faster than any other method available today. But the analysis — the interpretation, the themes, the argument you’re going to defend — that’s still your work. The point of using a tool like Cosmonote is to free up the days you’d have spent typing so you can spend them thinking.