Project Summary
TechMuse LogoTechMuse Coverstone

NanoStream was my group's final project for IST 440W (Search Engines). Starting with the open source search engine Nutch, our goal was to make a Nanotechnology search engine. Insipred by the design of an engine called eRace, we modified the Nutch engine to accept a XML preferences file. This file contained nanotechnology key words and raings which influenced the page ranking of the Nutch crawler. In addition, we developed a script which analyzed commonly searched for terms and added them as nanotechnology key words. This was designed because nanotechnolgoy is a rapidly changing field which required our crawler to adapt to changing conditions. I was responsible for the graphic modifications to Nutch, implementing the modified crawler, and parsing the XML configuration file.

Download the final report (pdf)
Download the final presentation (ppt)

Core Technologies
  • Nutch
  • Search Engines
  • XML
  • Adobe Photoshop
  • JSP