With the Action=Files query string selected, the following document schema will be supplied.
New in EasySearchASP.net Version 1.1
The <DuplicateUrls/> node tells you that the same page results were found under a different Url. For example, This can happen with the following URLs.
All three different URL's, but quite possibly the same content. EasySearchASP is now smart enough to recognize that these pages return the same content, and instead of indexing two pages, it simply ads the second URL to the <DuplicateUrls /> node.
It's also pretty common to run the same content under multiple domains. In this situation, if the content that comes back is the same, EasySearchASP will recognize it's duplicate content, and not add a second page. Neither the page name or domain names have to be the same, the content returned is whats examined for duplication.
1: <?xml version="1.0" encoding="utf-8" ?>
2: <Catalog TimeOnServer="2005-04-09T13:03:59.0937500-07:00">
3: <Pages Count="1">
4: <Page>
5: <Url>
6: <![ CDATA[ http://www.easysearchasp.net ]]>
7: </Url>
8: <Title>
9: <![CDATA[ Add a Search Engine to your ASP.NET Site ]]>
10: </Title>
11: <CrawledDate>2005-04-09T12:57:28.0156250-07:00</CrawledDate>
12: <KillDate>2005-04-10T12:57:28.0156250-07:00</KillDate>
13: <DuplicateUrls>
14: <DuplicateUrl>
15: <![CDATA[ http://www.easysearchasp.net/default.aspx ]]>
16: </DuplicateUrl>
17: </DuplicateUrls>
18: </Page>
19: </Pages>
20: </Catalog>