UNIT-4
How google reads our pages?
Google gets information from many different sources, including:
Web pages,
User-submitted content such as Google My Business and Maps user submissions,
Book scanning,
Public databases on the Internet,
and many other sources.
However, this page focuses on web pages.
Google follows three basic steps to generate results from web pages:
Crawling
The first step is finding out what pages exist on the web. There isn't a central registry of all web
pages, so Google must constantly search for new pages and add them to its list of known pages.
Some pages are known because Google has already visited them before. Other pages are
discovered when Google follows a link from a known page to a new page. Still other pages are
discovered when a website owner submits a list of pages for Google to crawl. If you're using a
managed web host, such as Wix or Blogger, they might tell Google to crawl any updated or new
pages that you make.
Once Google discovers a page URL, it visits, or crawls, the page to find out what's on it. Google
renders the page and analyzes both the text and non-text content and overall visual layout to
decide where it should appear in Search results. The better that Google can understand your site,
the better we can match it to people who are looking for your content.
Indexing
After a page is discovered, Google tries to understand what the page is about. This process is
called indexing. Google analyzes the content of the page, catalogs images and video files
embedded on the page, and otherwise tries to understand the page. This information is stored in
the Google index, a huge database stored in many, many (many!) computers.
Serving (and ranking)
When a user types a query, Google tries to find the most relevant answer from its index based on
many factors. Google tries to determine the highest quality answers, and factor in other
considerations that will provide the best user experience and most appropriate answer, by
considering things such as the user's location, language, and device (desktop or phone). For
example, searching for "bicycle repair shops" would show different answers to a user in Paris
than it would to a user in Hong Kong. Google doesn't accept payment to rank pages higher, and
ranking is done programmatically.
How google reads our pages?
Google gets information from many different sources, including:
Web pages,
User-submitted content such as Google My Business and Maps user submissions,
Book scanning,
Public databases on the Internet,
and many other sources.
However, this page focuses on web pages.
Google follows three basic steps to generate results from web pages:
Crawling
The first step is finding out what pages exist on the web. There isn't a central registry of all web
pages, so Google must constantly search for new pages and add them to its list of known pages.
Some pages are known because Google has already visited them before. Other pages are
discovered when Google follows a link from a known page to a new page. Still other pages are
discovered when a website owner submits a list of pages for Google to crawl. If you're using a
managed web host, such as Wix or Blogger, they might tell Google to crawl any updated or new
pages that you make.
Once Google discovers a page URL, it visits, or crawls, the page to find out what's on it. Google
renders the page and analyzes both the text and non-text content and overall visual layout to
decide where it should appear in Search results. The better that Google can understand your site,
the better we can match it to people who are looking for your content.
Indexing
After a page is discovered, Google tries to understand what the page is about. This process is
called indexing. Google analyzes the content of the page, catalogs images and video files
embedded on the page, and otherwise tries to understand the page. This information is stored in
the Google index, a huge database stored in many, many (many!) computers.
Serving (and ranking)
When a user types a query, Google tries to find the most relevant answer from its index based on
many factors. Google tries to determine the highest quality answers, and factor in other
considerations that will provide the best user experience and most appropriate answer, by
considering things such as the user's location, language, and device (desktop or phone). For
example, searching for "bicycle repair shops" would show different answers to a user in Paris
than it would to a user in Hong Kong. Google doesn't accept payment to rank pages higher, and
ranking is done programmatically.