Ethan Lazuk

SEO/GEO & marketing professional.


Hamsterdam Part 61: Weekly SEO & AI News Recap (6/3 to 6/9, 2024)

By Ethan Lazuk

Last updated:

A weekly look-back at SEO & AI news, tips, and other content shared on social media & beyond.

Hamsterdam Part 61 SEO News Recap with John Mueller quote.
Source: John Mueller (LinkedIn)

Opening notes, thoughts, and other musings:

  • Welcome to another week of Hamsterdam!
  • I’m publishing this recap a little later in the day. Hope that’s ok!
  • I greatly appreciate you reading it, though. 🙂
  • There’s a LOT to get to this week. So let’s hit it!

Feel free to jump ahead to the news recap if you’re in a hurry.

If you’d like these delivered every week, subscribe to my free newsletter here.


Marketing word of the week: “Demographics”

Demographics refers to particular subsets within a population that share attributes in statistical data.

Huh?

In the context of marketing, demographics are groups of people who share characteristics, like age, gender, income, or education.

Oh, ok.

They are used to identify a target audience or segments for tailored messaging and can inform buyer personas, channel selection (organic search, social, email, etc.), and content.

Specifically in digital marketing, demographics can determine targeted advertising with Google Ads or paid social, as well as organic strategies based on personalized campaigns.

In the context of SEO, demographics might influence keyword research, content creation, and even link-building (digital PR) outreach. They can also impact page experience decisions (like UX and design).

My 2 cents on demographics are …

I think the way they’re are commonly thought of is anachronistici.e., old school.

I hope to write more Hamsterdam Marketing articles involving social sciences topics soon, but in our first one about personas, we talked about using data and avoiding stereotypes.

Aside from eligibility (for example, “our service is designed for women age 25-45 living in California”), I think demographics should go out the window and instead be based on dominant shared traits, as determined by machine learning models.

These would be things like, “eats plant-based foods” or “enjoys living near water.”

For example, I started watching this free TV app called Tubi.

It didn’t have a user profile for me, so all of the ads were for low-cost products or how to save money. You know, the kind of topics you’d stereotypically expect to appeal to someone who watches a free TV app.

When in actuality, I just wanted to watch full episodes of Dog the Bounty Hunter for nostalgic purposes!

Dog the Bounty Hunter intro.

Maybe they would have served me up some more relevant content with more of my history, but that’s where topics like user interaction data and personalization, including for LLMs, come into play, and where I think the future of search is likewise headed.


AI word of the week: “Decision boundary”

A decision boundary is a separator between classes, either for binary or multi-class classification problems, in a machine learning model.

It helps the model determine which class a new data point belongs to based on its features or characteristics.

Examples of decision boundaries in classification algorithms.
Source

More technically, a decision boundary is called a problem space, where the output label of a classifier is ambiguous, i.e., it’s the dividing line between different predicted classes.

A classifier (algorithm) learns a decision boundary during training and then uses it to make predictions.

For example, if we trained a classifier to say whether a document was “helpful” or “unhelpful” based on labeled examples, it could then predict if a new document was helpful given where it fell relative to the decision boundary, as based on relevant attributes.

If the model learns the training data too well, overfitting can occur, where the boundary becomes highly complex and irregular, reducing its accuracy. The same can be said for underfitting.

Underfitting and Overfitting for decision boundaries.
Source

What if data falls on the decision boundary? It depends!

The model might use random assignment, provide a confidence score, adjust itself, or use a soft margin, where it tolerates some data points being on the wrong side with the goal of a more generalizable model.

When it comes to classification algorithms, decision boundaries are a handy concept to know.


This week in SEO history: “DMOZ” (1998)

On June 5th, 1998, two developers for Sun Microsystems named Rich Skrenta and Bob Truel created a non-commercial and multilingual catalog of websites called the Open Directory Project (or ODP).

It’s better known as DMOZ, named for its original domain: directory.mozilla.org. Here’s a screenshot from 1999 via WayBack Machine:

Open Directory Project on DMOZ domain circa 1999 via WayBack Machine.

The content of the catalog was maintained by a community of volunteers, and AOL owned it for most of its history, until the project ended on March 17th, 2017.

It’s obituary (sort of) was written on Search Engine Land by Danny Sullivan:

“DMOZ — The Open Directory Project that uses human editors to organize websites — is closing. It marks the end of a time when humans, rather than machines, tried to organize the web. …

DMOZ was born in June 1998 as ‘GnuHoo,’ then quickly changed to “NewHoo,” a rival to the Yahoo Directory at the time. Yahoo had faced criticism as being too powerful and too difficult for sites to be listed in.

It was soon acquired by Netscape in November 1998 and renamed the Netscape Open Directory. Later that month, AOL acquired Netscape, giving AOL control of The Open Directory.

Also born that year was Google, which was the start of the end of human curation of websites. …

DMOZ will live on in one unique way — the NOODP meta tag. This was a way for publishers to tell Google and other search engines not to describe their pages using Open Directory descriptions.”

– RIP DMOZ: The Open Directory Project is closing (Danny Sullivan, SEL)

The site today is 400 bad request error.

There’s another website called DMOZTools.net that says it’s “an independently created static mirror of dmoz.org and has no other connection to that site or AOL,” but it appears to be broken.

There’s also ODP.org, which says, “As former directory editors, we were sad to see DMOZ announce their closure on short notice, so we decided to use their RDF file to preserve the work & create a static snapshot of their web directory.” It’s not HTTPS, so I won’t link to it here, but it does appear to work.

The ODP.org About page has some additional history, including DMOZ’s influence on Wikipedia:

“AOL did not state a reason for closing the directory. However the directory (like other general purpose web directories) saw declining influence and usage as user search behavior shifted away from web directories to search engines. This shift was accelerated by the shift to mobile devices, as searching on mobile devices made instant answers a far better user experience than a list of websites. Many major search services like Google & Bing offer instant answers & knowledge panels in their search results which aim to answer frequently asked questions.

While searchers are still interested in a diverse range of content, online usage behavior has concentrated on a few major portal sites like Facebook, Instagram, YouTube, Amazon, eBay, Netflix, Google, Twiter, and TikTok.

The web directory model of listing websites struggled to compete on relevancy with search engines that quickly surfaced the most relevant pages (or even the most relevant answers) from widely trusted websites.

Larry Sanger claimed DMOZ was an inspiration driving the launch of Nupedia, which led to the founding of Wikipedia. Content from Wikipedia is used widely by search engines in instant answers and knowledge graph results. Search engines pay to license the content from the Wikimedia Foundation.”

– ODP.org About page

It’s an interesting history DMOZ has, born as a non-commercial and volunteer-supported project.

Speaking of volunteering and projects …

Let’s get to our introduction this week, talking about hard conversations.


Introduction to week 61: “hard conversations”

Location, location, location.

Buy Land AJ Sopranos GIF.

There’ve been some good local SEO articles published since the Google API Content Warehouse leak (that tiny event mentioned in last week’s intro). 😉

Andrew Shotland wrote a post for Search Engine Land.

Here’s a snippet of it:

“In theory, if you have titles throughout your site that are relevant to the user queries (a.k.a. ‘relevance’) + a strong localityScore (a.k.a. ‘proximity’), you should have a decent shot at good local rankings, assuming you are prominent enough.”

– Andrew Shotland, SEL

That made me think about my own situation.

I’ve run my business since January and also added much of my site’s content since then.

I put my site into Google’s NotebookLM experiment last night as a test, and this was the summary:

NotebookLM summary of my website.

That’s pretty good. As you can see, it says I work with businesses “across the globe.”

Now for Search …

I created a service page for Arabic-based businesses, which basically says I’m an American SEO with cultural knowledge of the Arab world.

I haven’t spent as much time on the page as I’d like, and you can see it ranks for likely location-based queries that don’t apply:

Arabic and Middle East SEO queries in Google Search Console.

On the other hand, a lot of my incoming calls these days are from Central Florida businesses, which makes sense because I’m pretty easy to find in Orlando:

Orlando SEO consultant query on Google showing my location page for that.

I’m happy about that, but I also plan to update that page this week.

Why would I do that if it ranks well?

Well, as I often remind businesses, it’s not about the keyword rankings; it’s about what they get you, business wise.

Which brings me to this week’s topic of “hard conversations.”

I’m fairly new to doing consultations with small businesses. I’ve helped plenty over the years, but that’s after they were qualified and the whole nine.

This scene from The Wire came to mind …

Stringer Bell: “Now, you need to call Mr. Goose, Mr. Faucet, whoever the f*ck you gotta call because I gave you money to run, remember that?”

Clay Davis: “It ain’t like no drսg deal, String. Hell, man, you don’t put your money on the street and have it come right back. It don’t work like that.”

Stringer Bell: “Oh, it don’t, huh?”

Clay Davis: “Patience, my man.”

– The Wire, Season 3, Ep. 9

Ok, bad example, haha …

But seriously, many of these small Florida businesses that I’m speaking with have the same story — they paid someone who wasn’t a knowledgeable SEO professional and got burned in one way or another.

By the time they come to me, they’re fed up and skeptical and like, “Why didn’t this work?!”

It’s never one or two fast and easy things. It’s invariably a collection of micro-improvements that will add up over time.

It’s a marketing strategy …

I don’t mind spending a little effort to put together recommendations these businesses can bring to their teams, whether we work together or not.

That’s just being a decent human being, imo.

But truthfully, it’s still a drop in the bucket.

Even as I was writing this — like literally — a Screaming Frog message popped up to tell me I’m running out of memory on a crawl.

It was for a small business that emailed me. I looked at their site and saw maybe 10 pages.

Screaming Frog found 150,000 URLs …

These are hard conversations to have.

But all we can do every day is wake up, brush our teeth (and floss!), and be honest about our capabilities and true to our ethics.

I just hope more of the rocket ship emoji crowd remembers that. 😉

Buckle up for a full week’s recap, and enjoy the vibes (a Major Tom tribute):

[Cicatriz ESP by The Mars Volta on YouTube. Removed the embed for improved mobile loading, but here’s the link!]

Thank you for supporting Hamsterdam and the cause of SEO & AI learning.

Missed last week? Don’t worry, I got you! Read Part 60 to catch up.

Other great sources of weekly SEO news:


Now, time for our weekly review of SEO social posts, articles, & more …

The Big Lebowski is this your homework Larry scene.

Quick summary

  • Google Search Central joined LinkedIn; more SEOs are joining Mastodon
  • AI Overviews visibility drops; seems to be coming back for me personally
  • Pick of the week: discussion (on YouTube) with Mike King, AJ Kohn, Daniel Foley, and Robin Fishley
  • Sneaky pick of the week: OpenAI’s paper on concepts from features in GPT-4
  • And much more!

Jump to a section of this week’s recap:

Or keep scrolling to see it all.

Ok, time to step inside the white flags of Hamsterdam …

Hamsterdam scene from The Wire with Carver pointing at the white flags.

SEO news, Google updates, SERP tests, or key posts

Notable updates or news related to Google Search or related SEO topics.

Excerpt: “Search is not dead, but is changing in a major way, and as much as SEOs like to make fun of generative results, how we expect to retrieve information and interact with the source of information is starting to shift away from the traditional paradigm to new and better way.”
Excerpt: “Google’s AI Overviews now appear less than 15% of the time, according to a new analysis. Google’s AI Overviews (formerly known as Search Generative Experience when it was an opt-in experiment in Google Labs) at one time appeared on 84% of queries.”
Excerpt: “I do suspect Google will slowly being to show them more often as: (1) They figure out how to get people to click on the ads with them there (2) They get more confident in the results being safe (3) They become cheaper (4) And maybe when the clicks to publishers in the free listings and those AI cards get clicks”
Note: This is my pick of the week. I thoroughly enjoyed hearing AJ’s POV on these topics along with the rest of the panel. More viewpoints shared in context together is a great way to explore information, imo. (I still like talking about E-E-A-T, though, on a conceptual level … don’t @ me … but I think sparse vs. dense vector embeddings and ScaNN are interesting, too 😉 … think like an engineer who’s thinking like a user who’s thinking about Amazon or TikTok. JK.) 😉

SEO tips & tidbits

Actionable tips, cool tidbits, and other findings and observations that can be teaching moments.

Excerpt: “This could be comparable to LinkedIn Collaborative Articles, which surged briefly before a huge drop.”
Excerpt: “The risk that Google could similarly be beaten in relevance by another company is highlighted by a startling conclusion from BERT: huge amounts of user [Redacted] can be largely replaced by unsupervised learning from raw text. That could have heavy implications for Google.”

Note: Great points made by MH here! There’s some relevant discussion to this regarding LLMs (which BERT qualifies as) replacing user interaction data in Google’s post-trial debrief, as well. // Related: In the years prior, Google spoke about integrating neural networks into more systems. Regarding DNNs (called “deep ML” in the email), there’s a TikTok video down below showing the impact of transformer-based architectures on Google’s translation. Studying dense vector embeddings, self-attention, and multi-headed attention in transformers is helpful context. Relatedly, a blog post in the AI section below coincidentally talks about Google’s history of using ML for coding and mentions a similar timeline.

SEO (and AI) fundamentals & resources

Essential information, concepts, or resources to learn about SEO or AI.

Articles, videos, case studies & more

Longer-form content pieces shared on social, in newsletters, and elsewhere.

Excerpt: “Generational gaps in technology use are widening, with Generation Z showing a marked decrease of 25% in Google usage compared to Generation X. This data points to a significant evolution in search habits, as younger users pivot to alternatives that better suit their informational needs and lifestyles. Social media platforms are no longer just for connectivity but have evolved into primary search tools for a quarter of the population. This shift reflects a broader trend where traditional search engines are supplemented, or even replaced, by social network searches.”
Excerpt: “Something that’s not widely understood is that Google Search has been an AI Search engine since at least 2015 with the introduction of RankBrain and other subsequent changes to the backend side of search. The big change in Search this year is that AI is more obvious on the front-end as a Feature in Search, largely replacing the role that Featured Snippets once played. Perhaps more importantly there may have been an infrastructure change at the beginning of 2024.”

Excerpt: “In order for backlinks to be evaluated by Google and to have the greatest possible influence on rankings, various properties of the links must be taken into account.” (Translated.)

Cosmic Rays & Crawlers – Google, Search Off the Record Podcast

[Removed the YouTube embed to improve mobile loading. 😉 View here.]

Technical SEO

Everything from basics to advanced moves (and also tools).

Excerpt: “N-grams are a sequence of consecutive words (or numbers, and symbols) found in text. They enable you to see the words used on a page, their frequency and patterns for various NLP tasks. Using n-grams has various limitations around topic modelling and semantic relevance, but they are a useful tool in SEO for simple text analysis, on-page alignment and even internal linking. … While Google has shifted way beyond simple keyword matching, having the words you want to rank on the page typically still helps in SEO. Keyword density is a myth, so there isn’t a magic density level you should aim for to improve rankings. However, the occurence, frequency and uniqueness of words are useful to analyse as an indiciation of basic relevancy (outside of semantic models).”

Excerpt: “Google explains that some CDNs use quickly expiring URLs for video and thumbnail files and encourages publishers and SEOs to use just one stable URL for each video. Something interesting to note is that not only does this help Google index the files it also helps Google collect user interest signals.”

Content marketing

From what is helpful content to user journeys and beyond.

Local SEO

From Google Business Profiles or reviews and more!

Data analysis & reporting

Showing that what you’re doing is helping.

Excerpt: “SEO professionals often grapple with the uncertainty of their strategies’ outcomes and the ever-changing algorithms of search engines. This uncertainty makes it challenging to confidently attribute changes in website traffic or rankings to specific SEO actions. This is where math, statistics, and advanced analytical tools come into play.” (Link to article.)

AI, machine learning, & LLMs

News related to models, papers, and companies.

Excerpt: “Generative text-to-image (T2I) models have unlocked immense capabilities to synthesize high-quality imagery. However, their creativity can lead to the generation of images that contain harmful content. … Advances in safety for T2I models have successfully mitigated harms from many of the more obvious failure modes, such as those where a user explicitly describes a harmful depiction (e.g., an excessively violent image). However, mitigating against less obvious adversarial attacks remains a challenge. We call these implicitly adversarial because they don’t contain explicit adversarial attacks or triggers that can be detected by existing safety filters.”
Excerpt: “Machine learning (ML) has the potential to revolutionize healthcare, from reducing workload and improving efficiency to uncovering novel biomarkers and disease signals. In order to harness these benefits responsibly, researchers employ explainability techniques to understand how ML models make predictions. However, current saliency-based approaches, which highlight important image regions, often fall short of explaining how specific visual changes drive ML decisions. … In ‘Using generative AI to investigate medical imagery models and datasets’, published in The Lancet eBioMedicine, we explored the potential of generative models to enhance our understanding of medical imaging ML models.”

Why it matters: This framework uses generative AI and expert review to identify and interpret the visual cues associated with model predictions, which can help researchers understand how models arrive at their decisions, helpful for medical diagnostics and bias in healthcare.

Notes: There’s a lot of potential with NotebookLM for SEO tasks, especially considering it reads URLs and on-page content quite well. (I plan to add more use cases to my LLMs for SEO post. TBD!)
Excerpt: “With the advent of transformer architectures, we started exploring how to apply LLMs to software development. LLM-based inline code completion is the most popular application of AI applied to software development: it is a natural application of LLM technology to use the code itself as training data. The UX feels natural to developers since word-level autocomplete has been a core feature of IDEs for many years. … In other words, the same amount of characters in the code are now completed with AI-based assistance as are manually typed by developers. “

Why it matters: This discusses AI-based assistance for software engineering, where Google is using ML to improve workflows. It speaks to the importance of UX and the effectiveness of ML-based automation. Of interest to me was the “user interaction logs” that collect data to capture how users interact with features, which is used to improve AI-based features in coding tools (training models on how to better assist users).

Excerpt: “We demonstrate here a dramatic breakdown of function and reasoning capabilities of state-of-the-art models trained at the largest available scales which claim strong function, using a simple, short, conventional common sense problem formulated in concise natural language, easily solvable by humans. The breakdown is dramatic, as models also express strong overconfidence in their wrong solutions, while providing often non-sensical “reasoning”-like explanations akin to confabulations to justify and backup the validity of their clearly failed responses, making them sound plausible.”

Why it matters: I’m working on a post about epistemic vs. aleatoric uncertainty in LLMs, or basically when there’s a hallucination vs. multiple correct answers. More broadly, the researchers in this paper are asking for an “urgent re-assessment of the claimed capabilities of current generation of LLMs.” As SEOs, I think the we can better understand confidence and reliability of answers from not only LLMs but IR systems and factor that into how we think of engineers trying to resolve these challenges.

Extracting Concepts from GPT-4 – Open-AI

Extracting concepts from GPT-4.
Excerpt: “Unlike with most human creations, we don’t really understand the inner workings of neural networks. … neural networks are not designed directly; we instead design the algorithms that train them. The resulting networks are not well understood and cannot be easily decomposed into identifiable parts. This means we cannot reason about AI safety the same way we reason about something like car safety. … Unfortunately, the neural activations inside a language model activate with unpredictable patterns, seemingly representing many concepts simultaneously. … While sparse autoencoder research is exciting, there is a long road ahead with many unresolved challenges. In the short term, we hope the features we’ve found can be practically useful for monitoring and steering language model behaviors and plan to test this in our frontier models.”

Why this matters: THIS IS A COOL TOPIC. 🙂 Features are matching patterns of neuron activations in deep neural networks that can be described as concepts, both abstract and concrete representations of human-understandable entities. For AI researchers, this offers a peek into how LLMs “think,” contributing to improved interpretability and safety. For SEOs, I believe this opens up a whole new frontier of opportunity for semantic-based strategy research. I wrote more about Anthropic releasing similar research about features in Claude. I plan to do the same for this OpenAI paper. In the meantime, here’s their full paper with feature visuals available.

Humor

Subjectively funny content.

Someone will have to explain the above joke to me …

TikTok

It’s a search engine, or so I’m told …

@consequence Kendrick Lamar delivered a surprise speech at Compton College's graduation ceremony on Friday. #kendricklamar #kendrick #compton #comptoncollege #commencement #rap #hiphop ♬ original sound – consequence
@deandocs Meet Marion Stokes. In 1979, Marion began recording TV 24/7, capturing sitcoms, news and everything in between. She was determined to preserve television history that networks were constantly erasing to save money and free up storage space. As a former librarian and activist, Marion was passionate about access to information. She sometimes had as many as eight tapes spinning away at once. Even getting her family in on the act. “We’d be out at dinner and we’d have to rush home to swap tapes” recalled her son Michael. By the time she passed away in 2012, Marion had amassed about 71,000 tapes – the only comprehensive collection preserving this period in television media history. Now, the Internet Archive is digitizing her incredible work to make it publicly available. Marion's vision of universal access to knowledge will finally be realized, thanks to her tireless effort of secretly recording everything for thirty years. #foundmedia #internetarchive #internethistory ♬ original sound – Dean
@the.wall.street.j32 How Google Translate Turns 134 Languages Into Math #google #language #math ♬ original sound – The Wall Street Journal

General marketing & miscellaneous

This is for great content that isn’t necessarily SEO or marketing-specific. PPC, PR, dev, design, and social friends, check it out!

Older stuff that’s good!

Not everything I find worth sharing is new as of this week, so these are gems I came across published in the past.

Great job making it to the end. You rock!

Want help with your SEO strategy?

I’m an independent SEO consultant based in Orlando, Florida, focusing on custom audits and strategies for brands. Don’t hesitate to reach out, or visit my about page for more information.

Let’s connect!

Hit me up anytime via text or call at 813-557-9745 or on social or email:

Cheers!

Editorial history:

Created by Ethan Lazuk on:

Last updated:

Need a hand with a brand audit or marketing strategy?

I’m an independent brand strategist and marketing consultant. Learn about my services or contact me for more information!

Leave a Reply

Discover more from Ethan Lazuk

Subscribe now to keep reading and get access to the full archive.

Continue reading

GDPR Cookie Consent with Real Cookie Banner