<?xml version="1.0" encoding="utf-8"?>
		<rss version="2.0">
		<channel>
		<title>SEO > Programming Project</title>
		<link>http://www.thinknetinc.com/code/</link>
		<description>Programming Project</description>
		<lastBuildDate>Thu, 09 Feb 2012 08:04:56 MST</lastBuildDate>
		<language>en-us</language>
			<item>
			<title>How Search Engines Look at Information</title>
			<link>http://www.thinknetinc.com/code/ItemID=12142483228680</link>
			<guid>http://www.thinknetinc.com/code/ItemID=12142483228680</guid>
			<pubDate>Mon, 23 Jun 2008 12:12:02 MST</pubDate>
			<description>&lt;div style=&quot;border: 1px solid black; margin: 5px; padding: 10px; background: none repeat scroll 0% 50% white; -moz-background-inline-policy: -moz-initial&quot;&gt; This is the third in a series of articles attempting to look at how programs are built and how one would begin to think about building their own search engine. This article is about how information on the web is laid out and how the major search engines must look at the context the creator provides. &lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part I: &lt;/strong&gt;&lt;a href=&quot;ItemID=12112224765651&quot; title=&quot;Creating a New Program&quot;&gt;&lt;strong&gt;&lt;u&gt;Creating a New Program&lt;/u&gt;&lt;/strong&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part II:&lt;/strong&gt; &lt;a href=&quot;ItemID=12112243152231&quot; title=&quot;Beginning The Search Project &amp;ndash; Understanding Short Cuts&quot;&gt;&lt;strong&gt;&lt;u&gt;Beginning The Search Project &amp;ndash; Understanding Short Cuts&lt;/u&gt;&lt;/strong&gt;&lt;/a&gt; &lt;/div&gt;&lt;/div&gt;&lt;p&gt;  At its core a search engine needs to find information locked up in documents, databases or web pages. After finding the information in those documents a search engine needs to deliver the information to someone based on a question. Hopefully the question itself can provide some context to help in delivering the proper result.&lt;br /&gt;&lt;br /&gt;You could look at it as if all this data in the world were flat, it starts off equal and depending on the way someone chooses to organize these bits, the more context they have.  If I place something in a pdf document, or an html document, or an image, I have a different set of organizational context surrounding it.&lt;br /&gt;&lt;br /&gt;A web page has meta tags that help to make sense of it, hypertext links to and fro that help to define it, and html and css that help to provide emphasis to certain parts, and the actual text information. This is at least the way a human could interpret it. So far search engine bots are not likely to interpret the document in the same way a human would. And that is understandable since two different humans would likely interpret the layout and context of the document differently also.&lt;br /&gt;&lt;br /&gt;Pdf's, word documents, and text style documents, may or may not have any formating to provide emphasis, and these tend not to be linked to in the same way as an html document. Although both html documents and pdf's do have url's, so the domain, or site itself may provide some context.&lt;br /&gt;&lt;br /&gt;But information is not really flat, since those individuals that create a document inherently provide context. If we consider a web site, there are two different elements, the site as a whole and the individual document. The site as a whole may be made up of thousands of urls and may or may not be related to a single concept. You would expect the homepage, or root directory files to contain the main concept of the whole domain. This does not include &lt;u&gt;&lt;a href=&quot;ItemID=12104384833949&quot; title=&quot;subdomain&quot;&gt;subdomains&lt;/a&gt;&lt;/u&gt;, which can be related to different computers.&lt;br /&gt;&lt;br /&gt;The basics of any search engine are how to interpret what someone types in. If someone types in &amp;ldquo;car&amp;rdquo;, then what should come up? What if a human were asked that same question, what would they come up with? But it is likely the question would be longer and more specific. To a certain extent people have been trained to speak (type) the way search engines want them to, since not many people in the real world would just say &amp;ldquo;car&amp;rdquo;. But on the web people type in &amp;ldquo;car&amp;rdquo; and other relatively nonsensical phrases. And since this web language has developed, many search engines and site owners understand this and have tailored their titles and results to reflect this development.&lt;/p&gt;</description>
			</item>
			<item>
			<title>Beginning The Search Project – Understanding Short Cuts</title>
			<link>http://www.thinknetinc.com/code/ItemID=12112243152231</link>
			<guid>http://www.thinknetinc.com/code/ItemID=12112243152231</guid>
			<pubDate>Mon, 19 May 2008 12:11:55 MST</pubDate>
			<description>&lt;div style=&quot;border: 1px solid black; margin: 5px; padding: 10px; background: none repeat scroll 0% 50% white; -moz-background-inline-policy: -moz-initial&quot;&gt; This is the second in a series of articles attempting to look at how programs are built and how one would begin to think about building their own search engine. This article is about understanding the nature of taking shortcuts and how the major search engines need these to perform up to current expectations.&lt;/div&gt;   &lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part I: &lt;/strong&gt; &lt;strong&gt;&lt;a href=&quot;ItemID=12112224765651&quot;&gt;&lt;strong&gt;&lt;u&gt;Creating a New Program&lt;/u&gt;&lt;/strong&gt;&lt;/a&gt;&lt;/strong&gt;&lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part III:&lt;/strong&gt; &lt;strong&gt;&lt;u&gt;&lt;a href=&quot;ItemID=12142483228680&quot; title=&quot;How Search Engines Look at Information&quot;&gt;&lt;strong&gt;How Search Engines Look at Information&lt;/strong&gt;&lt;/a&gt;&lt;/u&gt;&lt;/strong&gt; &lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt; The first thing you want to do is determine what you want to accomplish. If I were starting my own search engine then I first want to think about what exists and why. What don't I like about Google or Yahoo! search or MSN search.&lt;br /&gt;&lt;br /&gt;In regards to Google and the other major search engines, everyone dislikes bad results. But you have to determine what is a bad result. If you are searching for contemporary information then for the most part the results are easy to find. This is especially true with Google's search engine, since they frequently source news collections sites, such as Digg. And it is a point like this that needs to be highlighted. This is one of many shortcuts that Google takes to have better results. If there were no public relation sites, such as openpr.com or news aggregators, such as digg.com then Google would have a tougher time getting those results to the public. They would have to work much harder at understanding what is todays news and what is yesterday's news. And yesterday's news is just history, locked in documents called web pages.&lt;br /&gt;&lt;br /&gt;So news sites, public relation sites and so on are shortcuts that Google has capitalized on to make sure their results are better, then if they were simply relying on their own algorithms. Another of these shortcuts are separate programs to interpret certain types of search queries. Let's use the example:&lt;br /&gt;&lt;br /&gt;&amp;ldquo;grams in a pound&amp;rdquo;&lt;br /&gt;&lt;br /&gt;The first thing you will notice is that they get the right answer. The next thing you should realize is that they had to write a separate program to deal with that type of language. They provide their traditional results, but also the answer to your question.&lt;br /&gt;&lt;br /&gt;But if you try a little different search, say &amp;ldquo;1 pound is x grams&amp;rdquo;, they do not translate this into their program. You just receive the traditional results - and this is a matter of differentiating language, something which at this point needs to be very hard-coded.&lt;br /&gt;&lt;br /&gt;So lets say that:&lt;br /&gt;&lt;br /&gt;1.We want to produce good results&lt;br /&gt;2.We don't want to write separate programs specific to certain topics&lt;br /&gt;&lt;br /&gt;But we must understand the nature of shortcuts, and since the web changes (i.e., all sorts of people out there are adding all sorts of new structures all the time), the search engine's role must adapt. I suppose the ideal situation is some all-encompassing artificial intelligence that would adapt on its own, understanding that the web changes and that people are changing it. But for now lets further understand the use of shortcuts and how to make them work better. &lt;/div&gt;</description>
			</item>
			<item>
			<title>Creating a New Program</title>
			<link>http://www.thinknetinc.com/code/ItemID=12112224765651</link>
			<guid>http://www.thinknetinc.com/code/ItemID=12112224765651</guid>
			<pubDate>Mon, 19 May 2008 11:41:16 MST</pubDate>
			<description>&lt;div style=&quot;border: 1px solid black; margin: 5px; padding: 10px; background: none repeat scroll 0% 50% white; -moz-background-inline-policy: -moz-initial&quot;&gt; This is the first in a series of articles attempting to look at how programs are built and how one would begin to think about building their own search engine. &lt;/div&gt; &lt;div style=&quot;padding: 10px&quot;&gt; If you have the opportunity to build a program, you will notice that there are few resources to help you think about the overall structure. There are plenty of web resources for specific issues, but not too many when you want to think about the overall idea of how things should fit.&lt;br /&gt;&lt;br /&gt;For web program ideas there are many open-source programs you can open up and look at &amp;ndash; oscommerce, pligg, wordpress...etc but in some cases you are starting from scratch and these open-source programs are rather all over the place. A lot of companies use these open-source programs and end up hiring professionals to do any modifications, even just color changes. Anyway, open-source always has the downside of being a little too understandable to just about anyone with a little knowledge, and that may not be a good thing for some applications. So you could modify any of these open-source programs for your own purpose, but that may just drive you in the wrong direction.&lt;br /&gt;&lt;br /&gt;Some projects require SEO, and some programs do not. SEO may also steer you away from some open-source applications, since you might want to be innovative. Some programs are needed because they are for a job which may have strict requirements. But there are some projects that you just do for yourself.&lt;br /&gt;&lt;br /&gt;Perhaps, if you are like me you want to make something that you need for yourself, or maybe you are just interested in creating something new and interesting. If you are the type that likes to get into new projects, which may or may not result in a business, then you are generally starting from scratch.&lt;br /&gt;&lt;br /&gt;There are many programs that need creating, or improving, whether they are just something small or if they are something large. Many people have been thinking about, or are currently  in the midst of creating their own search engine, simply because they do not like the way the current crop of companies deal with information. But when you think about creating your own program you may come with all sorts of opinions, but soon you will face a reality.&lt;br /&gt;&lt;br /&gt;When you are thinking about the overall structure of a project you should realize that the thinking part will consume a great deal of time. Why are you doing it? What exactly do you want to accomplish? So the philosophy is a major part, and that can take months, then a little programming, then some more thinking.  &lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part II: &lt;/strong&gt;&lt;strong&gt;&lt;a href=&quot;ItemID=12112224765651&quot;&gt;&lt;/a&gt;&lt;u&gt;&lt;a href=&quot;http://thinknetinc.com/code/ItemID=12112243152231&quot;&gt;&lt;strong&gt;Beginning The Project - Understaning Short Cuts&lt;/strong&gt;&lt;/a&gt;&lt;/u&gt;&lt;/strong&gt;&lt;/div&gt;&lt;div style=&quot;padding: 10px&quot;&gt;&lt;strong&gt;Part III:&lt;/strong&gt; &lt;strong&gt;&lt;u&gt;&lt;a href=&quot;ItemID=12142483228680&quot; title=&quot;How Search Engines Look at Information&quot;&gt;&lt;strong&gt;How Search Engines Look at Information&lt;/strong&gt;&lt;/a&gt;&lt;/u&gt;&lt;/strong&gt; &lt;/div&gt;&lt;/div&gt;</description>
			</item></channel>
		</rss>
