Activity at WebSci
Last updated: Sun, 21 Jul 2002
I have designed and implemented these sub-projects from scratch:
- Simple SQL-based search engine. Tasks involved:
- A generic document parser and indexer.
- A search query interpreter (supports AND/OR operators, including/excluding words, grouping operators using parentheses).
- SQL generator for investigating the index database.
- Enabling the Ensiva project to generate PDF pages from an HTML design.
Tasks involved in this project:
- Breaking paragraphs into lines -- I implemented a variation of the Knuth's line breaking algorithm in a generic library which is currently used from two different projects.
- Generic PDF generation library.
- Page layout -- breaking tables across pages, keeping a minimum numbers of rows together on page, repeating headers on all pages, etc.
- Font handling library -- implemented an AFM parser for retrieving the font information and metrics. This is currently rewritten using FreeType.
Other activities include:
- Code documenting (my early days at WebSci).
- Profiling and performance improvements in some areas.
- Memory leaks detection and fixing.
- Bug fixing, in the areas that I developed myself as well in areas developed by others.
- Team/project management.
- Participated in the PQA builder project (Web Clipping Applications for Palm-powered devices).
- Ported code to Red Hat 7.2 and gcc-3 compiler (which proved about 4 times faster than 2.95.x releases). Rebuilt the development environment.
- Writting/rewiewing technical specs, documentation, white papers.
