Natural language text and image classifiers: Train your feeds with plain English

samuelclay · April 2, 2026, 8:07pm

  <p>The Intelligence Trainer has always worked with exact matches. You type a keyword, a tag, an author name, and NewsBlur filters on that literal string. Regex mode added flexibility, but you still need to know exactly what to type. If you want to hide clickbait, you’d have to enumerate every clickbait pattern you can think of. If you want to focus on stories about local government accountability, good luck expressing that as a regex.</p>

Natural language classifiers let you describe what you want in plain English. Instead of matching keywords, NewsBlur sends your description and each story to an AI model that understands what you mean. Write “stories about practical cooking techniques, not restaurant reviews” and it just works. Write “product launch announcements” and it finds them regardless of how each site phrases it. And with image classifiers, you can filter on what’s actually shown in a story’s photos, not just what’s written in the text.

Text classifiers

Open the Intelligence Trainer on any feed and you’ll see a new section: Natural Language Text Classifier. Type a description of what you want to focus on or hide, and press Enter. Not sure if your prompt will catch the right stories? Click Test on this story to see how the classifier would score it before you save, so you can refine your wording.

Your prompt is saved as a classifier pill, just like title and author classifiers. Toggle it between focus (green) and hidden (red) to control whether matching stories are promoted or suppressed. You can add multiple prompts per feed, and each one works independently.

When you save a new prompt, NewsBlur immediately classifies your recent stories against it. Within a few seconds, you’ll see stories re-sort as the classifications come in. From then on, every new story is classified as it arrives.

The classifier sends each story’s title and content to the AI model along with your description. The model decides whether the story matches, doesn’t match, or is clearly the opposite of what you described. That three-way classification means a “focus” prompt can also actively hide stories that are the antithesis of your interest.

When a text classifier matches, you’ll see a colored pill in the story header showing which prompt matched.

Image classifiers

Image classifiers work the same way, but they look at the photos in each story instead of the text. Describe what you want to see (or hide) visually, and the AI model examines each image to decide if it matches.

Some examples:

“Charts and data visualizations” — Focus on stories with graphs, tables, or infographics
“Screenshots of user interfaces” — Find product announcements that include actual UI screenshots
“Nature and landscape photography” — Surface photography posts in mixed-content feeds
“Memes” — Hide (or focus on) image macros and memes

The image classifier is strict about what counts as a match. It only triggers when the subject is literally visible as the main focus of the image, not when something is vaguely related or appears in a logo or watermark. “Food photos” matches a photo of a plate of pasta, not a restaurant storefront.

Image classifier matches also show as pills in the story header, just like text classifiers.

Combining with classifier notifications

Natural language classifiers become even more powerful when paired with per-classifier notifications. Set up a text classifier like “breaking news about AI regulation” on a high-volume news feed, then attach a notification to that classifier pill. You’ll get a push notification or email only when a story matches your natural language description, not every time the feed publishes.

This turns NewsBlur into a semantic alert system. Instead of monitoring keywords, you’re monitoring concepts. A classifier for “security vulnerabilities in open source libraries” will catch stories whether they say “CVE,” “zero-day,” “supply chain attack,” or any other phrasing. Add a notification and you have a monitoring pipeline that understands what you care about.

Scoping

Like all classifiers in NewsBlur, natural language classifiers support three scope levels:

Per site — Applies only to the feed you’re training (default)
Per folder — Applies to every feed in the folder
Global — Applies to every feed you subscribe to

A global text classifier like “sponsored content” can hide promotional stories across your entire feed list with a single prompt.

Usage-based billing

Natural language classifiers use AI models to evaluate every story, which means there’s a real cost per classification. Rather than bundling this into a fixed subscription tier, NewsBlur uses usage-based billing so you only pay for what you use.

Text classifications cost roughly a tenth of a cent per story. Image classifications cost more because they process image data, roughly half a cent per story. The actual cost depends on story length and image count, but for a typical feed publishing 30 stories a month with one text classifier, you’d pay about 5 cents a month.

You can set a monthly spending limit to cap your costs. If you hit the limit, classification pauses until the next billing cycle. Your existing cached results still show, but new stories won’t be classified until the limit resets.

Availability

Natural language text and image classifiers are available now on the web for Premium subscribers with usage-based billing enabled. Enable it from Manage > Account to get started.

If you have feedback or ideas for improvements, please share them on the NewsBlur forum.

This is a companion discussion topic for the original entry at https://blog.newsblur.com/2026/04/02/natural-language-text-and-image-classifiers/

MitchWagner · April 2, 2026, 9:01pm

This looks potentially extremely useful. Thank you for implementing it.

In theory, I should be able to point Claude at my Readwise Reader account, ask it to craft a prompt based on what I’ve stored in there, and then use that to create a global natural language text classifier to train my feeds to surface items likely to interest me. Would that work?

anon-37641789236 · April 2, 2026, 9:33pm

Lots of possibilities! I guess my question is similar to or a restatement of the one above: could one set up just 1-2 very detailed (long) classifiers? Is there a limit on how big that chunk of text (“screenshots of UIs”) can be?

samuelclay · April 2, 2026, 10:30pm

I didn’t add a limit so it could get pretty big! It’s basically a prompt, so try it out and we’ll adjust as it runs.

samuelclay · April 2, 2026, 10:31pm

I like that idea of creating a custom prompt based on an AI agent’s reasoning traces on a corpus of text. If we need to expand the size of the text field, I’d be happy to do that. Try it out first and see if it works for you. I may have to adjust pricing estimates if the difference is more than negligible.

AvocadoToast · April 2, 2026, 11:52pm

Hello, I’d like to make a request as a reader who doesn’t want to engage with any LLMs or generative AI: could you perhaps start using a tag in the NewsBlur blog for posts talking about LLM-based features?

I’m subscribed to NewsBlur’s update blog, but the last few posts were all LLM-focused and I would really appreciate to be able to filter these out.

samuelclay · April 3, 2026, 12:22am

If only there were some mechanism by which you could filter out stories that don’t have explicit tags…

AvocadoToast · April 3, 2026, 11:00am

But see, that’s the point, it’s a bit silly to have to use an LLM to not see LLM-related content

MitchWagner · April 3, 2026, 6:00pm

Both Claude and Gemini fell down on the task of creating a custom prompt based on Readwise and my personal blog. Or rather they did create prompts but it looked like they would not result in finding articles that actually interest me.

However, I turned Claude loose on my professional writing and it did pretty well at crafting a Google News query (actually several), which I will adapt for classifiers when I have time.

I will also try out natural language classifiers to see if they can be configured to dislike articles written in Spanish as well as articles about sports and gaming. I do not speak or read Spanish (though really I should learn) and I have no interest in sports or gaming.