Ruslan R. Fayzrakhmanov
Projektassistent (Project assistant) Dr.techn. (PhD) Dipl.-Ing.(ViennaTU) M.Sc.(PermSTU)
Address |
Institute of Information Systems,
Database and Artificial Intelligence Group,
Vienna University of Technology,
Favoritenstraße 9-11,
1040 Wien, Austria
|
Mail |
|
Phone |
+43-1-58801-18448 |
Room |
HA 0306 |
Office Hours |
by appointment |
About me
I am a project assistant at the Institute of Information Systems,
Database and Artificial Intelligence group, working in the area of
Web Information Extraction and Web Accessibility under the supervision
of
Prof. Reinhard Pichler
and
Dr. Robert Baumgartner.
I hold my master's degree from
the
Information Technology and Automated Systems department of the Perm State Technical University (Russia) in 2008.
I received my Ph.D. in Computer Science from
Vienna University of Technology in January 2014.
My PhD study has been partially supported by the Erasmus Mundus Program of the European Union.
From January 2016 I am a
research assistant at the University of Oxford, Computer Science Department.
Research projects
Teaching
- Principle advisor of bachelor students and a co-supervisor of master students..
Lectures: Summer term 2014
Surveys
Publications
Selective publications
-
Models and approaches for web information extraction and web page understanding.
Ruslan R. Fayzrakhmanov.
The Evolution of the Internet in the Business Sector: Web 1.0 to Web 3.0, P. Isaias, P. Kommers, and T. Issa, Eds., chapter 2, pages 25-50. IGI Global, 2015.
[ bib ]
-
Web accessibility for the blind through visual representation analysis.
Ruslan R. Fayzrakhmanov.
PhD Thesis (Dissertation), Vienna University of Technology, December 2013.
[ bib |
@library ]
-
Web object identification for web automation and meta-search.
Iraklis Kordomatis,
Christoph Herzog,
Ruslan R. Fayzrakhmanov,
Bernhard Krüpl-Sypien,
Wolfgang Holzinger,
and Robert Baumgartner.
In Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics (WIMS 2013), Madrid, Spain, 12-14 June, 2013, Article No. 13. ACM, 2013.
[ bib |
ACM |
pdf ]
-
Web objects identification for web automation: Objects and their features.
Ruslan R. Fayzrakhmanov,
Christoph Herzog,
and Iraklis Kordomatis.
Technical report DBAI-TR-2013-80, Institute of Information Systems, TU Vienna, Vienna, 2013.
[ bib |
DBAI Technical Reports (pdf) ]
-
Feature-based object identification for web automation.
Christoph Herzog,
Iraklis Kordomatis,
Wolfgang Holzinger,
Ruslan R. Fayzrakhmanov,
and Bernhard Krüpl-Sypien.
In Proceedings of the 28th Annual ACM Symposium on Applied Computing (SAC 2013), Web Technologies Track, Coimbra, Portugal, 18-22 March, 2013, pages 742-749. ACM.
[ bib |
ACM ]
-
WPPS: A framework for web page processing.
Ruslan R. Fayzrakhmanov.
In Proceedings of the 13th International Conference on Web Information Systems Engineering (WISE 2012), Demo Session, Paphos, Cyprus, 28-30 November, 2012,
pages 800-803. Springer.
[ bib |
Springer ]
-
WPPS: A novel and comprehensive framework for web page understanding and information extraction.
Ruslan R. Fayzrakhmanov.
In Proceeding of the International Conference IADIS WWW/Internet, Madrid, 18-21 October,
pages 19-26, Madrid, 2012. IADIS Press.
[ bib ]
-
A versatile model for web page representation, information extraction and content re-packaging.
Bernhard Krüpl-Sypien,
Ruslan R. Fayzrakhmanov,
Wolfgang Holzinger,
Mathias Panzenböck,
and Robert Baumgartner.
In Proceedings of the 11th ACM Symposium on Document Engineering
(DocEng2011), Mountain View, USA, 19-22 September,
pages 129-138, New York, 2011. ACM.
[ bib |
ACM ]
-
Web 2.0 vision for the blind.
Robert Baumgartner,
Ruslan R. Fayzrakhmanov,
Wolfgang Holzinger,
Bernhard Krüpl,
Max C. Göbel,
David Klein,
and Rafael Gattringer.
In Proceedings of Web Science Conference 2010 (WebSci10), Raleigh, USA, 26-27 April, 2010,
page 1-8.
[ bib |
Web Science ]
-
Modelling web navigation with the user in mind.
Ruslan R. Fayzrakhmanov,
Max C. Göbel,
Wolfgang Holzinger,
Bernhard Krüpl,
Andreas Mager,
and Robert Baumgartner.
In Proceedings of the International Cross Disciplinary Conference on Web Ac-
cessibility (W4A'2010), Raleigh, USA, 26-27 April, 2010,
page 1-4, New York, 2010. ACM.
[ bib |
ACM ]
-
A unified ontology-based web page model for improving accessibility.
Ruslan R. Fayzrakhmanov,
Max C. Göbel,
Wolfgang Holzinger,
Bernhard Krüpl,
and Robert Baumgartner.
In Proceedings of the 19th International Conference on World Wide
Web (WWW'2010), Raleigh, USA, April 26-30, 2010,
pages 1087-1088, New York, 2010. ACM.
[ bib |
ACM ]
Reviewing
External reviewer for the following journals and conferences:
Software and models
- UOM
- A Unified Ontological Model formalizing some aspects of the web page conceptualization.
- WPPS
- A Web Page Processing System, a new, highly configurable Java-based framework for developing effective and robust methods that address problems in the fields of Web Page Understanding and Web Information Extraction.
- MANM
- A Multi-Axial Navigation Model, a model enhancing web page accessibility.
- Blindzilla
- A set of prototypes for accessible web page navigation.
- css-drawing-order-detection
- An algorithm for detecting painting order of css boxes and layers of the HTML web documents.
A painting order is computed according to the CSS 2.1 specification and factors in the particularities of Firefox 3.6 (XULRunner/Gecko 1.9.2).
- RegExpTokenizer
- It is an information extraction tool which extends expressiveness of regular expressions with additional constructs and requires a user to manually define a wrapper. Thus, the user can impose additional constraints for the length and value (string or numerical) of the returned string. It is also possible to define new concepts specified by the extended language as well as use them in the definition of other concepts.
The UOM, MANM, and the Blindzilla prototypes were developed based on the cooperative work with the team of the ABBA project.
Datasets
- ATW
- Annotated Transport Web Forms, a dataset for the basic web object identification problem.
The ATW dataset was developed by the cooperative work with the team of the TAMCROW project.
- WPPS-HTML-DS1 (zip)
- A dataset contains web pages of CNN and Amazon websites, some subset of the RISE corpus which is used for comparing and testing information extraction systems, and a collection of web forums provided by Big Boards (Web pages of web forums were collected by Wolfgang Holzinger.)
June 2016