chris-castillo-dev · July 28, 2024 21:23 · PropelDMS · Jul 28, 2024
diff --git a/Screaming-Frog-Internal-Links-XPath-Options.txt b/Screaming-Frog-Internal-Links-XPath-Options.txt
 ## Exclude Element with ID (replace div with any tag)
 and not(ancestor::div[@id='MyID'])

 ## Exclude Element with Class
 and not(ancestor::body[contains(@class, 'My-Class')])

 ## Exclude Element with Attribute (replace div or attribute/value as needed)
 and not(ancestor::div[@data-elementor-type='header'])
 and not(ancestor::div[@data-id='981cb'])

 ## Chat GPT Prompt
 Please act like an expert with Screaming Frog web crawler and custom extractions using XPath values. Below is an extraction I configured that looks for links not found in the <header> or <footer> tag:

 //a[not(ancestor::header) and not(ancestor::footer) and not(ancestor::div[@data-elementor-type='header']) and not(contains(@href, '/wp-content/'))][starts-with(@href, '/') or starts-with(@href, './') or starts-with(@href, '../') or starts-with(@href, '#') or contains(@href, 'mydomain.com')]/@href

 Please update this XPath value to also exclude links found ....
	## Exclude Element with ID (replace div with any tag)
	and not(ancestor::div[@id='MyID'])

	## Exclude Element with Class
	and not(ancestor::body[contains(@class, 'My-Class')])

	## Exclude Element with Attribute (replace div or attribute/value as needed)
	and not(ancestor::div[@data-elementor-type='header'])
	and not(ancestor::div[@data-id='981cb'])

	## Chat GPT Prompt
	Please act like an expert with Screaming Frog web crawler and custom extractions using XPath values. Below is an extraction I configured that looks for links not found in the <header> or <footer> tag:

	//a[not(ancestor::header) and not(ancestor::footer) and not(ancestor::div[@data-elementor-type='header']) and not(contains(@href, '/wp-content/'))][starts-with(@href, '/') or starts-with(@href, './') or starts-with(@href, '../') or starts-with(@href, '#') or contains(@href, 'mydomain.com')]/@href

	Please update this XPath value to also exclude links found ....