Search Deployment Guidance - Part 3 with notes from the SharePoint UG session
In the previous two posts in this series I showed some ideas and thoughts on how to approach your SharePoint Search deployment. This third post is a wrap up as well as some external resources including training videos previously published on the Search TechCenter site on TechNet.
Read the previous two posts in this series.
SharePoint Search Deployment Guidance (Part 1)
SharePoint Search Deployment Guidance (Part 2)
In my last post I mentioned about the Search indexing component in MOSS. This Search Architecture white paper available from Microsoft (MS Word) provides details on how the Indexing Architecture works. In summary the two core core components of a Search solution are as follows.
The indexing component is capable of crawling and indexing a variety of content sources. It uses protocol handlers to connect and traverse the source systems and restive the system level meta-data. When it identifies content it will invoke known iFilters to read those content. (Example: SharePoint for Search requires the PDF iFilter to retrieve PDF content).
The indexing engine maintains a file-based index, which contains the indexed content and also maintains managed properties also known as the property store or search schema with scope definitions in the search and configuration databases managed by SQL Server.
Queries are initiated when a user enters a search term and the term is passed by the query server to the query engine. The query engine then searches the full-text index for items that contain the searched-for terms. The end user results are supplemented with keywords, best-bets, and managed properties from the search configuration database, managed by SQL Server. Queries are initiated through the Search object model or the Search Web service on Web servers. So as a developer you can also create your own Search implementation using the object model.
The following articles are available for Developers.
Enterprise Search Protocol Handlers
Using Custom Filters with Indexing Services
Programmatically Administering Enterprise Search (From the SharePoint Server 2007 SDK)
Best Practices: Writing SQL Syntax Queries for Relevant Results in Enterprise Search
Evaluating and Customizing Search Relevance in SharePoint Server 2007
Customizing the Enterprise Search User Interface
Building Custom Enterprise Search Web Parts
Building Enterprise Search Queries
Visual How-to guides and screen casts.
Creating Business Data Catalog Entities in SharePoint Server 2007
Refining Business Data Catalog Search with Scopes
Creating Content Sources to Crawl Business Data in SharePoint Server 2007 Enterprise Search
Creating a Custom Search Page and Tabs in the Search Center of SharePoint Server
Creating and Exposing Search Scopes in SharePoint Server 2007 Enterprise Search
Creating Search Queries Programmatically by using the Search Web Service in SharePoint Server 2007
In a server deployment you should also understand the different components that make up your Search solution. Depending on the number of content sources and the size of your organisations the server roles within your Search solution may have significant impact on how you architect a solution to provide a robust search implementation.
The following are server roles that you need to take into consideration.
Within these you can consolidate all of these on a single server but the recommendation is to have these separated or combined to at least two or more servers. For example you can have one server for Index and Query. (Query role is the server that takes the search requests and has a copy of the propagated index). But in a high availability scenario you should always look to splitting these roles. If you physically separate the query server role from the indexer onto one or more servers, then the full-text index is propagated from the indexer to all query servers your farm.
More guidance on this is available on Joel Oleson’s blog. SharePoint Farm Topology for Search Index and Query.
Planning for high availability - SharePoint TechNet Article
](https://www.chandima.net/Blog/Lists/Posts/Attachments/155/searchcomv2_2.jpg) Through community driven development to enhance the Search user experience the following utilities are available through the CodePlex Search Community Toolkit site. One of the most talked about solutions is the Faceted Search solution ( http://www.codeplex.com/FacetedSearch) which provides facet based category search for your standard SharePoint deployment. It should be noted though that you should test these out as part of your deployment well before you implement these.
For administrators who are managing a SharePoint deployment tools such as the Search Crawl log exporter (http://www.codeplex.com/ExportCrawlLog) provides the ability to move and monitor crawl logs to find summary information about crawls.
The Search relevancy tool adds the capability to set higher relevancy for specific results in your organisation. This is handy and the tool will allow you to change the weighting and normalization of Managed properties and some of the system level relevancy settings. (http://www.codeplex.com/searchrelevancy)
With the inclusion of these Search solutions you can also take ownership of the source and build and customise to your requirements.
If you want to learn in-depth about Search the following video training series is available for download.
The following 14 recorded presentations are based on training modules from three-day, in-person training sessions. The presentations provide details about key enterprise search capabilities in Microsoft Office SharePoint Server 2007. Additionally, the presentations provide guidance about how to combine SharePoint products and technologies in order to build, implement, and manage enterprise search solutions.
Module 1: Workshop Overview (http://go.microsoft.com/fwlink/?LinkId=115635&clcid=0x409)
Module 2: Enterprise Search Overview (http://go.microsoft.com/fwlink/?LinkId=115636&clcid=0x409)
Module 3: SharePoint Search 2007 Walk through (http://go.microsoft.com/fwlink/?LinkId=115637&clcid=0x409)
Module 4: Search Architecture and Deployment Scenarios (http://go.microsoft.com/fwlink/?LinkId=115638&clcid=0x409)
Module 5: Crawl and Query Processes (http://go.microsoft.com/fwlink/?LinkId=115639&clcid=0x409)
Module 6: Relevance Ranking (http://go.microsoft.com/fwlink/?LinkId=115640&clcid=0x409)
Module 7: Customizing the End-User Experience (http://go.microsoft.com/fwlink/?LinkId=115641&clcid=0x409)
Module 8: Developing Search Solutions (http://go.microsoft.com/fwlink/?LinkId=115642&clcid=0x409)
Module 9: Business Data Catalog Search (http://go.microsoft.com/fwlink/?LinkId=115643&clcid=0x409)
Module 10: Extensibility and Integration for Search (http://go.microsoft.com/fwlink/?LinkId=115644&clcid=0x409)
Module 11: Search Administration (http://go.microsoft.com/fwlink/?LinkId=115646&clcid=0x409)
Module 12: Security for Search (http://go.microsoft.com/fwlink/?LinkId=115647&clcid=0x409)
Module 13: Performance Scalability and Capacity Planning for Search (http://go.microsoft.com/fwlink/?LinkId=115648&clcid=0x409)
Module 14: Search Operations (http://go.microsoft.com/fwlink/?LinkId=115649&clcid=0x409)
The presentation “Making the Most of Search in SharePoint (Zip 1MB)” which used for the UG is available for download as a PDF file.