My question before was when you do site:washingtonpost.com, none of the
pages are cached. My original finding is the tags below:
<META CONTENT="NO-CACHE" HTTP-EQUIV="CACHE-CONTROL">
<META CONTENT="NO-CACHE" HTTP-EQUIV="PRAGMA">
But then I see a lot of pages in that domain without those metatags and
they're not cached either. It's not the "log in" protection, because
that's done in a JavaScript redirect to the login page depending on
cookie, and I know search engines don't read JavaScript. It's not good
practice, but it definately wouldn't cause the whole site not to be
cached. It might be some "no cache request" but I just don't know
where it's doing it.
The pages in washingtonpost.com are getting some rankings (example: for
keyword "Midtown Bethesda North"). Even though Google isn't caching
those pages, does that mean Google doesn't know anything about what's
on those pages except for the Title and Meta tags? Or does it mean
Google knows what's on those pages (such as keyword focused content,
keywords in h1 tags, etc), but is just hiding the cached version from
users because of some kind of "no cache request" from
washingtonpost.com's programming?
Thanks,
Paul.