Activity at WebSci

Last updated: Sun, 21 Jul 2002

I have designed and implemented these sub-projects from scratch:

  • Simple SQL-based search engine. Tasks involved:

    • A generic document parser and indexer.
    • A search query interpreter (supports AND/OR operators, including/excluding words, grouping operators using parentheses).
    • SQL generator for investigating the index database.
  • Enabling the Ensiva project to generate PDF pages from an HTML design. Tasks involved in this project:

    • Breaking paragraphs into lines -- I implemented a variation of the Knuth's line breaking algorithm in a generic library which is currently used from two different projects.
    • Generic PDF generation library.
    • Page layout -- breaking tables across pages, keeping a minimum numbers of rows together on page, repeating headers on all pages, etc.
    • Font handling library -- implemented an AFM parser for retrieving the font information and metrics. This is currently rewritten using FreeType.

Other activities include:

  • Code documenting (my early days at WebSci).
  • Profiling and performance improvements in some areas.
  • Memory leaks detection and fixing.
  • Bug fixing, in the areas that I developed myself as well in areas developed by others.
  • Team/project management.
  • Participated in the PQA builder project (Web Clipping Applications for Palm-powered devices).
  • Ported code to Red Hat 7.2 and gcc-3 compiler (which proved about 4 times faster than 2.95.x releases). Rebuilt the development environment.
  • Writting/rewiewing technical specs, documentation, white papers.
Style config