Link Rot: How to find new sources for busted links

There are a variety of ways to find what you're looking for when you come across a broken link on the interwebs. Here's a few methods i like to use.

Search operators

The first thing you should know is how to use a search engine. Various search engines will attach a special meaning to certain characters and these 'search operators' as they're called can be really helpful. Here's some handy examples that work for Google as well as some other search engines (and no, you shouldn't be using Google directly):

OR : 'OR', or the pipe "|" character, tells the search engine you want to search for this OR that. For example cat|dog will return results containing 'cat' or 'dog', as will cat OR dog.

( ) : Putting words in a group separated by OR or | produces the same result as just described, however you can then add words outside of the group that you always want to see in the results. For example, (red|pink|orange) car will return results that have 'car' in them, as well as either red or pink or orange.

" ": If you wrap a "word" in double quotes, you are telling the search engine that the word is really important. If you wrap multiple words in double quotes, you are telling the search engine to look for pages containing "that exact phrase."

site: : If you want to search only a particular domain, such as 12bytes.org, append site:12bytes.org to your query, or don't include any search terms if you want it to return a list of pages for the domain. You can do the same when preforming an image search if you want to see all the images on a domain. You can also search a TLD (Top-Level Domain) using this operator. For example, to search the entire .gov TLD, just append site:.gov to your query.

-: If you prefix a word with a -hyphen, you are telling the search engine that you are not interested in results containing that word. You can do the same -"with a phrase" also.

cache:: Prefixing a domain with cache:, such as cache:12bytes.org, will return the most recent cached version of a page.

intitle: : If you prefix a word or phase with intitle:, you are telling the search engine that the word or phrase must be contained in the titles of the results.

allintitle: : Words prefixed with allintitle: tells the search engine that all words following this operator must be contained in the titles of the search results.

See this page for more examples.

Searching the archives

One of the simplest methods of finding the original target of a busted link is to copy the link location (right click the link and select 'Copy Link Location') and plug that into one of the web archive services. The two most popular, general archives that i'm aware of are the Internet Archive and Archive.is. The Internet Archive provides options to filter your search results for particular types of content, such as web pages, videos, etc.. In either case, just paste the copied link in the input field they provide and press your Enter key. If the link is 'dirty', cleaning it up may provide better results. For example, let's say the link is something like:

http://example.com/articles/1995/that-dog-dont-hunt?ref=example.com&partner=sombody&utm_source=google

The archive may not return any results for the URL, but it might if you clean it up by removing everything after 'hunt'.

There are also web browser extensions you can install to make accessing the archive services easier. For Firefox i like the View Page Archive & Cache add-on by 'Armin Sebastian'. When you find a dead link, just right-click it and from the 'View Page Archive' context menu you can select to search all of the enabled archives or just a specific one. Even if the page isn't dead you can right-click in the page and retrieve a cached copy or archive the page yourself. Another cool feature of this add-on is that it will place an icon in the address bar if you land on a dead page and you can just search for an archived version from the icon context menu.

Of these two services, the Internet Archive has a far more extensive library, but there's a very annoying caveat with it that defeats the purpose of an archive which is why i much prefer Archive.is. The Internet Archive follows robot.txt directives. I won't go into why i think this is stupid, suffice to say that content that is stored on the Internet Archive can be removed even if it does not break any of their rules.

Dead links and no clues

If all you have is a dead link with no title or description and you can't find a cached copy in one of the archives, you may still be able to find copy of the document somewhere. For example let's say the link is https://example.com/pages/my-monkey-stole-my-car.html. The likely title of the document you're looking for is right in the URL — my-monkey-stole-my-car — and you can plug that into a search engine just as it is, or remove the hyphens and wrap the title in double quotes to perform a phrase search. Also see some of the other examples here.

Dead links with some clues

If you come across a dead link that has a title or description, but isn't cached in an archive, you can use that to perform a search. Just select the title, or a short but unique phrase from the description (which preferably doesn't contain any punctuation), then wrap it in double quotes and perform a phrase search.

Dead internal website links

If you encounter a website that contains a broken link to another page on the same site and you have some information about the document, like a title or excerpt, you can do a domain search to see if a search engine may link to a working copy. For example, let's assume the title of the page we're looking for is 'Why does my kitten hate me?' on the domain 'example.com'. Copy the title, wrap it in double quotes and plug it into a search engine that supports phrase searches, add a space, then append site:example.com. This will tell the search engine to look for results only on example.com. Also see some of the other examples here.

YouTube videos you know exist but can't find

Because there is a remarkable amount of censorship taking place at YouTube, they will sometimes hide sensitive videos from their search results when you use the search engine provided by YouTube. To get around this, use another search engine to perform a domain search as described in the 'Dead internal website links' section.

Deleted videos

In some cases, such as with a link that points to a removed YouTube video, you may not have any information other than the URL itself, not even a page title. Using the YouTube link as an example, https://www.youtube.com/watch?v=abc123xyz, copy youtube.com/watch?v=abc123xyz, wrap it in double quotes and plug that into your preferred search engine. You will often find a forum or blog post somewhere that will provide helpful clues, such as the video title or description which you can use to search for a working copy of the video. And the first place to look for deleted YouTube videos is YouTube! You can also search the Internet Archive as well as other video platforms that are more censorship resistant than YouTube, including Dailymotion, BitChute, DTube, LEEKWire and many others.

Broken links on your own website

I don't know about you, but i have nearly 4,000 links on 12bytes.org as of this writing and many of them point to resources which TPTB (The Powers That [shouldn't] Be) would rather you knew nothing about. As such, many of the resources i link to are taken down and so i have to deal with broken links constantly, many of them deleted YouTube videos. If you run WordPress (self-hosted – i don't know about a wordpress.com site) you will find Broken Link Checker by 'ManageWP' in the WordPress plugin repository and it's job is to constantly scan your site to look for broken links. While it is not a bug-free plugin (the developer is not at all responsive and doesn't seem to fix anything in a timely manner), it is by far the most comprehensive tool of its type that i'm aware of. There are also many external services you could use whether you run WordPress or not.

Content updates, 29-Nov-2018

The following new articles were added:

Ethan Huff and the Green Swastika : Ethan, who writes for Natural News, is an idiot regarding Germany's role in WW2, the same as Mike "The Health Ranger" Adams is, the founder (or so i assume) of NN. He recently 'penned' an article titled The Green Swastika: Environmentalism was a pillar of the Third Reich, the content of which is far more absurd than its absurd title reveals. Ethan gets nearly everything wrong and provably so, and, as usual, i pull fewer than zero punches when addressing such mindless ignorance.

Resources for research : This is just a list of web resources which i've used for researching various stuff that i write about. I though i'd share it. Cause i'm like that.

VPS host review: Pride Tech Design : This website is currently hosted by Pride Tech and i wanted to write a review about their service in order to con everyone into signing up with them so i can start raking in piles of kickback cash from Simba at PTD. Either that, or i've written a reasonably neutral review of possibly one of the best VPS server hosts on this side of the Milky Way that i'm aware of. You decide.

The following articles were updated:

How to access RSS feeds for websites that don't advertise one : Some added information and polishing.

Alternative Search Engines That Respect Your Privacy : unbubble.eu was added to the list and the article underwent the application of some polish (and likely the correction of spellin' errors).

Take the Jewish Holocaust Quiz! : The quiz was handled by a plugin that hasn't been updated in years and so i dropped it and stuck the quiz on a regular page. It has also been updated.

A personal perspective: From Windows to Linux to Windows to Linux to… : I re-titled the article and applied some polish.

Firefox Search Engine Cautions, Recommendations : This article underwent some minor changes and an updated search.json.mozlz4 file for Firefox was uploaded.

Firefox Extensions – My Picks : I dumped all of the legacy (XUL) add-ons, swapped out some add-ons with other ones and added several new add-ons to the additional functionality list.

Articles

Jump to: Investigations | History | Technology | Health | Ramblings | Resources

Investigations

History

Technology

Health

Ramblings

Resources

Resources for research

Following are some of the resources i've discovered that i've found helpful for a wide variety of research projects. If you have any to share, please contact me or leave a comment.

General purpose search engines

  • Good Gopher: "Search the independent media (GoodGopher.com)"
  • Internet Archive: Internet Archive is a non-profit library of millions of free books, movies, software, music, websites, and more.
  • Library Genesis: "Library Genesis or LibGen is a search engine for articles and books on various topics, which allows free access to content that is otherwise paywalled or not digitized elsewhere."
  • searx: "Searx is a metasearch engine, aggregating the results of other search engines while not storing information about its users."

Assassinations

Business

  • Corporation Wiki: "Corporation Wiki exists to provide corporate transparency and historical data on companies."
  • LittleSis: "a grassroots watchdog network connecting the dots between the world's most powerful people and organizations"

Digital archives

  • archive.today: General web archive service
  • Archiveteam: "Archive Team is a loose collective of rogue archivists, programmers, writers and loudmouths dedicated to saving our digital heritage."
  • Google News Archive: Search older newspaper articles
  • Internet Archive: "Internet Archive is a non-profit library of millions of free books, movies, software, music, websites, and more."
  • Internet Sacred Text Archive: "This site is a freely available archive of electronic texts about religion, mythology, legends and folklore, and occult and esoteric topics."
  • List of online newspaper archives (Wikipedia): "a list of online newspaper archives and some magazines and journals, including both free and pay wall blocked digital archives"
  • Michael Collins Piper Report: An archive of podcasts and articles by one of America's leading iconoclastic journalists.
  • PDF Archive: "E-books in PDF format, rare, censored, banned and unknown"
  • Spotlight News Archive: "a listing of many of the hundreds of articles that have appeared in The SPOTLIGHT since we first went on-line in 1997"
  • Unz Review Books Archive: "A Collection of Interesting, Important, and Controversial Perspectives Largely Excluded from the American Mainstream Media"

Government

  • Bilderberg
  • Center for Responsive Politics: "the Center for Responsive Politics is the nation's premier research group tracking money in U.S. politics and its effect on elections and public policy."
  • Cryptome: "Cryptome welcomes documents for publication that are prohibited by governments worldwide, in particular material on freedom of expression, privacy, cryptology, dual-use technologies, national security, intelligence, and secret governance — open, secret and classified documents — but not limited to those."
  • Defrauding America: "40 Years of Recorded Corruption in Government Positions,
    Endemic Cover-Ups, and Great Tragedies"
  • Governmentattic.org: "governmentattic.org provides electronic copies of thousands of interesting Federal Government documents obtained under the Freedom of Information Act."
  • GovTrack.us: "GovTrack.us tracks the United States Congress and helps Americans participate in their national legislature. We publish the status of federal legislation, information about your representative and senators in Congress including voting records, and original research on bills and votes."
  • Ted Gunderson: A former FBI Special Agent, Gunderson published many investigative articles and participated in many interviews about his research work post-FBI. He is well known for his work on the 'FINDERS', a world-wide network of people and organizations involved in child trafficking and exploitation
  • The Black Vault: "Exposing government secrets… one page at a time."
  • The National Archives Catalog: "The National Archives and Records Administration (NARA) is the nation's record keeper."
  • We The People Foundation: "The We The People Foundation for Constitutional Education has been established to fulfill the need for popular education including education, awareness, and knowledge about the Declaration of Independence and every provision of the federal and state constitutions, about the sovereignty of the people whose will the constitutions are designed to express, and about the government they are meant to control through their constitutions."

Health

  • Educate-Yourself: "Educate-Yourself.org is a free educational forum dedicated to the dissemination of accurate information in the use of natural, non-pharmaceutical medicines and alternative healing therapies in the treatment of disease conditions."
  • Open Payments Data: "The Open Payments Search Tool is used to search payments made by drug and medical device companies to physicians and teaching hospitals."

History

  • British Movietone: "British Movietone is arguably the world's greatest newsreel archive, spanning the period 1895 to 1986."
  • History Commons: "The website is a tool for open-content participatory journalism. It allows people to investigate important issues by providing a space where people can collaborate on the documentation of past and current events, as well as the entities associated with those events."
  • Holocaust: See Rescuing Israel: The Holocaust
  • Rare Historical Photos

Legal

  • Find Law: "Learn About the Law features informational articles about a wide variety of legal topics, as well as specific information about subjects such as how to hire an attorney and understanding your state's unique laws."
  • Law Notes: "Through the courts, encouraging the Government to obey the law."
  • Natural-Person: "A non-profit site educating Canadians and Americans
    about the differences between human-beings,
    natural-persons and artificial-persons"
  • United Nations – Universal Declaration of Human Rights: "The Universal Declaration of Human Rights (UDHR) is a milestone document in the history of human rights. Drafted by representatives with different legal and cultural backgrounds from all regions of the world, the Declaration was proclaimed by the United Nations General Assembly in Paris on 10 December 1948 (General Assembly resolution 217 A) as a common standard of achievements for all peoples and all nations. It sets out, for the first time, fundamental human rights to be universally protected and it has been translated into over 500 languages."

Military

  • Air Force History Index: "The United States Air Force Historical Research Agency maintains hundreds of thousands of historically significant U.S. Air Force documents representing some 70 million pages of interest to historians, researchers, and Air Force buffs."
  • Stephen S. Clark Library: "The Clark Library combines our map collection, government information center, and spatial and numeric data services. It is a rich and unique resource, where students and scholars from every discipline–as well as those working across disciplines–can find the materials, tools, and expertise to meet their research needs."

People

  • Notable Names Database: "NNDB is an intelligence aggregator that tracks the activities of people we have determined to be noteworthy, both living and dead."
  • Wikiquote: "Wikiquote is a free online compendium of sourced quotations from notable people"

Science

  • Sci-Hub: "the first pirate website in the world to provide mass and public access to tens of millions of research papers"

Miscellaneous

  • 33 Conspiracy Theories That Turned Out To Be True
  • Crystalinks: "Crystalinks is perhaps the largest, most comprehensive and ambitious metaphysical and science website on the Internet today, a journey taken by millions of visitors each month that is spiritually, mentally and emotionally in tune with its readers' lifestyles, needs, zeitgeist and the transcendental quest of humanity."
  • Deep Web Search Engines
  • Edgar Cayce's Association for Research and Enlightenment: "Edgar Cayce's Association for Research and Enlightenment (A.R.E.) was founded more than 80 years ago to help people to transform their lives. Our goal is to be your resource for body-mind-spirit materials that will help you in your everyday experience to be all that you can be."
  • Family Guardian: "Nonprofit Christian religious ministry dedicated to protecting people and families from extortion, persecution, exploitation,socialism, divorce, crime, and sin. Advocate personal liberty, personal responsibility, constitutional and SMALL and accountable government, sovereignty, and religious faith."
  • From The Wilderness: "Information on Peak Oil, Sustainability, and the events surrounding 9/11"
  • Lines of Credit: Ropes of Bondage: "LINES OF CREDIT: ROPES OF BONDAGE is about the financiers, their fellow conspirators and the plot to destroy Western Christian civilization."
  • Metapedia: "An electronic encyclopedia about culture, art, science, philosophy and politics."
  • The Corbett Report: "The Corbett Report is an independent, listener-supported alternative news source. It operates on the principle of open source intelligence and provides podcasts, interviews, articles and videos about breaking news and important issues from 9/11 Truth and false flag terror to the Big Brother police state, eugenics, geopolitics, the central banking fraud and more."
  • THE PAGE: "This is yer brain – BAD PEOPLE WANNA SINK THEIR HOOKS IN IT"
  • WantToKnow.info: "This website provides a concise, reliable introduction to vital information of which few are aware. We specialize in providing fact-filled news articles and concise summaries of major cover-ups and corruption which impact our lives and world."
  • WHALE: "Mapping & Killing the Matrix with Orgonite & Knowledge"
  • WikiLeaks: "WikiLeaks specializes in the analysis and publication of large datasets of censored or otherwise restricted official materials involving war, spying and corruption."
  • Wikispooks: "Wikispooks is an open licensed, open source encyclopedia of deep politics."

Specialized tools

  • FotoForensics: "FotoForensics provides budding researchers and professional investigators access to cutting-edge tools for digital photo forensics."