Voice and Video Outreach

Voice and Video Outreach: Does It Still Outperform Text in 2026?

TL;DR — The 2026 Verdict Personalised video and voice outreach still outperforms text — but only when used on the right 5–15% of your prospect list. The data is real (10–25% reply rates vs. 3.43% for text, per Instantly’s 2026 benchmark), but the labour cost only pays back on high-value targets. What you’ll get below: a Prospect Value Score that tells you when to record video vs. send text, three scripts calibrated for link builders (not SaaS SDRs), and a cost-per-reply calculator so you can model the ROI before you commit to a Vidyard subscription.

The honest answer in one sentence

Yes, video and voice still outperform text in 2026 — but the gap has narrowed, the tooling has fragmented, and the volume math has flipped. If you blindly switch your whole link-building outreach to video, you’ll spend three times more hours for fewer total placements. If you switch only the top tier of your prospect list, you’ll double your reply rate on the targets that matter most.

This article is written for link builders, not SaaS sales reps. The reply-rate studies you’ve seen everywhere come from B2B SDR teams selling £30,000+ ARR products. Link building has a different cost-per-reply tolerance, a different decision-maker profile (editors, not buyers), and a different scale problem (you’re sending 200 pitches a week, not 2,000). The frameworks below are calibrated for that reality.

The Prospect Value Score: When to use video, when to use text

The single biggest mistake link builders make with video outreach is treating it as a default. Video earns its return only on prospects above a value threshold. Below that threshold, plain text wins on a cost-per-placement basis every time.

Score each prospect on the five inputs below before deciding the medium. Each input is scored 1–5. Add them up. Higher total = more reason to record.

The Prospect Value Score (PVS) inputs

Input1–5 scaleWhy it matters
Domain Rating (DR)1: <30 / 2: 30–49 / 3: 50–69 / 4: 70–84 / 5: 85+How much link equity you stand to gain
Topical relevance1: tangential / 3: same vertical / 5: direct competitor or category authorityEditors care more when the pitch fits
Editor seniority1: junior writer / 3: section editor / 5: editor-in-chief or founderSenior editors reward effort signals
Replacement cost of the link1: easy alternative exists / 5: this is the only realistic placementHow much you’d pay to replace this link if you lost it
Personalisation surface area1: stock title page / 5: recent post, podcast, conference talk to referenceYou need something specific to record about

How to read your score

Score bandMediumWhy 
20–25RECORD: 60–90 sec personalised videoTop 5% of list. Use Loom/Vidyard with screen-share on their site. 
15–19RECORD: 30 sec LinkedIn voice noteLower production cost, still personal. Use after connection request. 
10–14TEXT with personalised hookStandard email with one specific reference to their work. 
Below 10SKIP or templateNot worth the touch. Either deprioritise or send a templated email. 
 ⚡ Operational rule of thumb If you’re outreaching to 200 prospects this week, expect roughly 15–25 to score 20+. That’s the entire video budget for the week. Recording video for the other 175 is the most expensive way to discover that DR 35 editors don’t care about your face.

What the 2026 data actually says

Before we get into scripts and tooling, let’s establish what the numbers actually support. Most reply-rate claims you’ll see online are inflated by selection bias — the case studies are from teams who already had warm pipelines, premium tools, and dedicated video producers.

Text vs. video vs. voice: the honest comparison

FormatReply rateSourceCaveat
Plain text cold email (2026 baseline)3.43%Instantly 2026 benchmark reportAll industries averaged
Text email with personalised subject line~5.1%Sendr industry dataLift comes from subject line alone
Personalised video cold email10–16%Multiple platform studiesLoom/Vidyard users; warm-list bias
Deeply personalised video (walkthrough)20–30%Russell Liebowitz case study60+ sec, screen-share, named prospect
LinkedIn voice note (mid-sequence)+30–40% vs textUnkoa case study (Timpl)Solo agencies, niche lists
LinkedIn voice (Letterdrop playbook)~15% lift overallLetterdrop 2026 strategy docCombined with content engagement

Three honest caveats

  • Selection bias is everywhere. The 30% reply-rate case studies almost always involve curated lists of 20–50 prospects, not 500. At link-building scale, expect the middle of the range (10–16%), not the headline number.
  • Open rates are no longer trustworthy. Apple Mail Privacy Protection has been inflating open data since 2022. Optimise for replies, not opens.
  • Link-builder reply rates run lower than SaaS sales. Editors get pitched daily. A 12% reply rate from a UK trade publication editor is excellent; the same number from a SaaS prospect would be average. Calibrate your expectations to your category, not to LinkedIn screenshots.
📊 The number to internalise If your current text-based outreach is hitting 3–5% reply rates, a well-executed video layer on your top 10% of prospects should pull your overall reply rate to 6–8% — a ~70–100% lift. That’s the realistic, repeatable outcome. Anyone promising 25%+ without a curated list is selling tools.

The three scripts that actually work for link builders

Generic sales scripts don’t translate. Editors and webmasters aren’t buyers — they’re publishers. They need a reason to publish, not a reason to convert. Below are three battle-tested formats calibrated for link-building outreach specifically.

Script 1: The 60-second video for guest post pitches (DR 70+ targets)

🎬 Setup Tool: Loom or Vidyard. Open with their site/blog in the background. Record once per prospect — no batching, no AI clones for this tier. Length: 55–70 seconds. Beyond 90s, drop-off destroys reply rate.

The script structure

  1. Seconds 0–8 (Hook): “Hi [name], it’s [your name] from [your site]. I just spent the last twenty minutes reading your piece on [exact article title] — really specific about why it stood out to you.”
  2. Seconds 8–25 (Context): Screen-share their article. Highlight one paragraph. Say something they cannot have heard from another pitcher (“You make the point about X here, but I noticed you didn’t cover Y — I have data on Y”).
  3. Seconds 25–45 (The pitch): Switch to your own draft or a data snippet on-screen. “I’ve put together [specific deliverable]. It updates your X with 2026 data, adds a section on Y, and the angle is [the angle].”
  4. Seconds 45–60 (CTA): “If this looks useful, I’ll send the full draft within 48 hours. Reply to this email with a yes and I’ll get it across. No problem if it’s not a fit — just wanted to put the idea in front of you.”
⚠️ What NOT to do Don’t open with your face on camera. Open with their site visible. Cuts to face only after the hook.Don’t mention your tool stack. Editors don’t care that you use Ahrefs.Don’t ask for the link in the video. Ask for permission to send the draft. Two-step ask = higher reply rate.

Script 2: The 30-second LinkedIn voice note (mid-sequence, DR 50–70 targets)

Voice notes work because LinkedIn engagement signals don’t penalise them, almost nobody else sends them, and the production cost is one-tenth that of video. The catch: they only work after a connection request has been accepted. Cold voice notes get reported.

When to send

  • Day 1: Send blank or one-line connection request
  • Day 2–4: Like or comment on one of their recent posts (genuine engagement)
  • Day 5: Send voice note (the script below)
  • Day 8: Send brief text follow-up referencing the voice note

The 30-second script

🎤 Voice note script “Hi [name] — [your first name] here. Quick voice note rather than a wall of text, because I know your inbox is a war zone.” “I noticed your recent post on [topic] — particularly the point about [specific thing they said]. I run [your site], and we’ve published [related piece] that adds [specific angle] to that conversation.” “Not pitching a link — just wondering if you’d be interested in the dataset behind it. Reply yes and I’ll send the underlying numbers and the methodology. Either way, enjoy the rest of your week.”

The structural trick: the ask is for the dataset, not the link. You’re inviting reciprocity rather than transaction. The link conversation happens after they reply, asking for the data.

Script 3: The voice memo for digital PR follow-ups (any DR)

This one is underused and disproportionately effective. After a journalist or trade-press editor has accepted your initial pitch but gone quiet for 5+ days, a 20-second voice memo in your email follow-up rescues an unusual number of placements.

The 20-second voice memo (recorded in a phone voice-memo app, attached or linked)

🎤 Voice memo script “Hi [name], it’s [your name] from [your site]. I sent over the [data/story/quote] last [day]. No pressure at all — but if it’s helpful, I have a fresh angle I didn’t put in the original email: [one sentence, one new angle].” “Two-second reply works fine — yes, no, or ‘send more’. Cheers.”

Why this works: it acknowledges they’re busy, adds genuine value, asks for the smallest possible reply, and demonstrates that you treat them like a human rather than a placement quota. UK trade-press editors in particular respond well to this — the medium signals respect.

When video actively hurts your link building

Most articles on this topic skip the failure modes. They shouldn’t — failure modes are where most of the budget gets burned. Here are the four scenarios where video reduces your placement rate.

Failure mode 1: Cold first contact

A video from a stranger feels invasive in 2026, particularly in UK and European inboxes where GDPR awareness is high. Reply rates on a first-touch cold video can run lower than plain text. Video earns its lift in turn 2 or 3 of a sequence, not turn 1.

Failure mode 2: High-volume programmatic outreach

If you’re sending 500+ pitches a week for niche edits or broken-link campaigns, video math doesn’t work. At 3 minutes per recording, 500 videos = 25 hours of production time per week. The reply-rate lift never recovers that cost. For these campaigns, stick to text with sharp personalisation hooks.

Failure mode 3: Technical or developer audiences

GitHub maintainers, Stack Overflow contributors, and dev-tooling editors actively dislike video pitches. The audience prefers asynchronous, scannable, code-referenced communication. A 90-second video asking for a link from a dev tools blog will outright kill the relationship before it starts.

Failure mode 4: Low-personalisation surface area

If you can’t find something specific about the prospect to reference — a recent piece, a podcast appearance, a position they hold — you don’t have enough material to record. A video that says “I loved your work” without specifying which work is worse than no video at all. It signals batch production.

🚫 The single biggest mistake AI lip-sync clones for cold pitches. The 2026 wave of tools that promise to clone your voice and personalise videos at scale (Sendr, HeyGen, Pitchlane) work for SaaS sales because the buyer doesn’t care about authenticity — they care about the offer. For link building, where you’re asking an editor to publish you, AI-cloned video reads as deceptive when discovered, and it gets discovered. The risk-to-reward ratio is wrong. Use human-recorded video on tier-1 prospects only.

The 2026 tool stack (with real pricing)

Pricing below is from publicly listed plans as of Q1–Q2 2026. All numbers are USD unless stated.

Tool2026 pricingBest forWeaknessRecommended use
Loom$15/user/mo (Business)Fast recording, universal thumbnail recognitionPersonalisation depth, CRM integrationSolo link builders, low volume
Vidyard$59/user/mo (Pro)Per-recipient analytics, CRM hooks, AI AvatarPrice, learning curveAgency teams, high-value placements
Sendspark$49/mo (Solo)Record-once-personalise-many for cold outreachReliability complaints, AI clone ethicsVolume teams (but ethically tricky for links)
BombBomb$36/user/moStrong email deliverability integrationLess personalisation toolingEmail-heavy sequences
Bonjoro$15/mo (Personal)Mobile-first, trigger-basedLess B2B-focusedSolo agency, mobile workflows
LinkedIn voice (native)Free (mobile only)30%+ lift, no tooling costManual, no automation, mobile-only recordingMid-sequence touches
Phone voice memoFreeLowest production cost, highest authenticityManual upload to emailFollow-up voice memos

The two-tool stack 95% of UK link builders should run

  • Loom (Business plan, $15/mo): for the 60-second video pitches on tier-1 (PVS 20+) prospects.
  • LinkedIn native voice notes (free): for tier-2 (PVS 15–19) prospects, mid-sequence.

Total stack cost: $15/month. If you’re spending more than this on video outreach tooling and you don’t yet have hard reply-rate data justifying the spend, you’re over-tooled.

Cost-per-reply calculator: when does video actually pay back?

Run these three numbers for your own campaign before committing to a video-first strategy.

Inputs you need

  • A: Your current text-based reply rate (e.g., 4%)
  • B: Your hourly rate or your team’s loaded hourly cost (e.g., £40/hr)
  • C: Average production time per personalised video (research + record + send = realistically 6–10 minutes)

The calculation

🧮 Cost-per-reply formula Text cost per reply: (time to send personalised text email ÷ reply rate) × hourly cost Example: (3 min ÷ 0.04) × (£40/60) = £50/reply Video cost per reply: (time per video ÷ reply rate) × hourly cost Example: (8 min ÷ 0.12) × (£40/60) = £44/reply Video wins by ~£6 per reply at these numbers. But notice: if your video reply rate drops from 12% to 8%, the formula flips and text wins by £17/reply. The lift has to be at least 3× your text reply rate to justify the time investment.

The break-even threshold

For a link builder at £40/hour, sending 3-minute personalised text emails at a 4% reply rate, video outreach only pays back when your video reply rate clears roughly 10–11%. Below that, text wins the cost-per-reply battle even if video wins on absolute reply count.

✅ The decision rule Only use video on prospects where you’d happily pay £50–£100 to secure the link. That’s roughly DR 70+ in most categories, or DR 50+ in highly relevant trade verticals. For anything below that bar, text is the rational choice.

UK and GDPR-specific considerations

UK-based link builders pitching UK editors face a tighter regulatory environment than US teams. Three rules worth internalising:

1. Soft opt-in for business addresses

Under UK GDPR and PECR, a personalised cold outreach email to a business email address remains lawful under legitimate interests, provided the message is genuinely relevant to the recipient’s role and offers a clear opt-out. Video and voice attachments don’t change this calculus — they’re treated the same as text under PECR.

2. Voice-clone disclosure

If you use an AI voice clone (Sendr, ElevenLabs-based workflows), UK consumer law and emerging AI regulation increasingly favour disclosure. For link-building outreach specifically — where the relationship is the asset — the reputational risk of being caught using cloned voice on an editor without disclosure is genuinely high. The safer position in 2026: don’t use AI voice clones for outreach where the recipient assumes they’re hearing you.

3. LinkedIn voice notes are still the safest channel

Voice notes sent through LinkedIn after a mutual connection sit on the cleanest legal footing of any video/voice outreach format. The connection itself functions as soft consent; the platform has clear rules; nothing leaves LinkedIn’s environment. For UK link builders, this is the default channel for any voice outreach that isn’t going to a tier-1 video target.

A worked example: how to apply this in a real week

Take a typical week: you’re running outreach for a mid-sized UK SaaS client, 180 prospects identified, mix of DR levels.

Step 1: Score every prospect

Run all 180 through the Prospect Value Score (5–10 minutes total if you’ve built it as a column in your prospect spreadsheet).

Step 2: Triage by score band

Score bandProspectsMediumTime costRealistic outcome
20–251260-sec personalised Loom video~120 mins productionExpected 2–3 placements
15–192830-sec LinkedIn voice (mid-seq)~70 mins recordingExpected 3–4 placements
10–1476Text email, personalised hook~230 mins writingExpected 3–4 placements
Below 1064Template or skip~30 mins send timeExpected 1–2 placements

Step 3: The total math

Roughly 7.5 hours of production time across the week, distributed by prospect value. Expected total placements: 9–13. The same time spent recording video for every prospect would have produced ~18 hours of work, lower team morale, and probably 4–7 placements as the bottom-tier recordings landed flat.

The framework saves ~10 hours per week and roughly doubles output per hour. Over a year, that’s about 500 hours of reclaimed time — enough to run an entire second campaign.

Frequently asked questions

Does video outreach still outperform text in 2026?

Yes, on the right segment of your prospect list. Personalised video achieves 10–16% reply rates versus 3.43% for plain-text cold email (per Instantly’s 2026 benchmark). But the lift only justifies the production time on high-value targets — typically DR 70+ prospects or highly relevant trade verticals. On the bottom 70% of a typical link-building prospect list, text remains the more cost-effective choice.

What’s the best video outreach tool for link builders?

Loom is the default recommendation for solo link builders and small agencies — $15/month, universal thumbnail recognition, and the most reliable recording experience. Vidyard ($59/user/month) is worth the upgrade for agency teams who need per-recipient analytics and CRM integration. Avoid AI clone tools (Sendr, HeyGen) for link-building outreach specifically — the deception risk outweighs the scale benefit when you’re pitching editors.

How long should a video outreach pitch be?

55–70 seconds for the initial pitch. Beyond 90 seconds, drop-off rates kill reply rates. The structure: 8-second hook with their site on-screen, 17 seconds of specific context referencing their work, 20 seconds of the actual pitch, 15 seconds of a clear two-step CTA. Total ~60 seconds.

Do LinkedIn voice notes work better than email video?

They work differently. LinkedIn voice notes lift reply rates 30–40% over text in mid-sequence touches and cost a tenth of what video production costs. But they only work after a connection is established. For first-touch cold outreach, email video is still the better choice. The optimal stack uses both: video for cold tier-1 prospects via email, voice notes for warm mid-sequence touches on LinkedIn.

Is AI voice cloning safe to use for link-building outreach?

Not recommended. The technology works (tools like Sendr can produce thousands of personalised “videos” from one recording), but for link building specifically the risk profile is wrong. Editors and journalists who discover they were pitched with a cloned voice tend to remember — and the relationship becomes unrecoverable. Save AI cloning for low-relationship channels where the recipient doesn’t care; use human-recorded video for relationship channels where they do.

What reply rate should I realistically expect from video outreach?

If you’re targeting your top 10–15% of prospects with genuinely personalised 60-second videos, expect 10–16% reply rates in most categories. The headline 25–30% rates you’ll see in case studies typically involve curated lists of 20–50 prospects with deep manual personalisation — that’s not link-building scale. For a campaign of 200+ prospects, treat 12% as a strong result and anything above 18% as exceptional.

Should I disclose if I’m using AI-generated voice or video?

Yes. UK consumer law and the evolving regulatory environment around AI-generated media increasingly favour disclosure, and for link-building outreach specifically the relationship cost of being caught without disclosure is high. If you’re using AI personalisation tools, disclose the tooling honestly in your sign-off. The good news: most outreach platforms now have a disclosure field built into templates.

How does video outreach compare to multi-channel sequences?

They’re complementary rather than competing. A modern outreach sequence combining text email, LinkedIn voice, and a strategic video touch typically outperforms any single channel by 50–100%. The cleanest playbook: text email day 1, LinkedIn connection day 3, voice note day 5, video on email follow-up day 8 (only for tier-1 prospects). For more on building these flows, see our guide to multi-channel outreach and the broader link building strategies playbook.

Where to go from here

Video and voice outreach in 2026 isn’t a switch you flip — it’s a layer you add to a strong text foundation. If your text outreach is hitting 3–5% reply rates, video on your top tier should pull total reply rates to 6–8%. If your text outreach is below 2%, fix the text first; video won’t rescue a fundamentally weak prospect list or pitch angle. For the underlying outreach foundations, start with our link building strategies hub, and for the broader 2026 picture see our link building statistics 2026 report.

If you’re sourcing the right tools for the rest of your stack, our best link building tools comparison covers everything that sits alongside Loom and Vidyard in a modern link-builder’s toolkit. And if you’re new to the discipline more broadly, our guide to backlinks is the right place to start.

Leave a Reply

Your email address will not be published. Required fields are marked *

Subject Lines That Get Opened Previous post Subject Lines That Get Opened: A/B Test Data From 50,000 Pitches
Multi-Channel Outreach Next post Multi-Channel Outreach: Email + LinkedIn + Twitter Sequences for Link Building