Profile
Philipp Schaer is Professor for Information Retrieval with the Institute of Information Science at TH Köln (University of Applied Sciences) where he teaches courses on Information Retrieval, Database Systems, and Search Engine Technology. He was team leader and postdoctoral researcher at the GESIS department Computational Social Science (CSS) where he led a team of computer, social and information scientist. His professional work was on topics like semi-/automatic indexing, semantic annotation, and using knowledge organization systems to enhance information retrieval.
He studied computer science with special interest in information retrieval and human factors in information systems and graduated at University of Koblenz-Landau where he received his degree as Diplom-Informatiker and later his doctorate in computer science. He has been working in DFG-funded research projects on human-computer interaction, open access repositories and value-added services for information retrieval and published in the areas of information retrieval, informetrics, and digital libraries. He serves as reviewer for journals, edited books, international conferences and workshops.
His research interests are: information retrieval, query expansion, applied informetric methods in digital libraries, evaluation of information retrieval systems, and especially living lab evaluation environments.
List of Publications
Validating Synthetic Usage Data in Living Lab Environments.
Journal of Data and Information Quality, 16(1):1-33, 2024.
Timo Breuer, Norbert Fuhr and Philipp Schaer.
[doi] [pdf]
[BibTeX]
ARTS: Assessing Readability & Text Simplicity.
In:
Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, November 12-16.
Association for Computational Linguistics, 2024.
Björn Engelmann, Christin Katharina Kreutz, Fabian Haak and Philipp Schaer.
[BibTeX]
Context-Driven Interactive Query Simulations Based on Generative Large Language Models.
In:
ECIR 2024.
2024.
Björn Engelmann, Timo Breuer, Jana Isabelle Friese, Philipp Schaer and Norbert Fuhr.
[pdf]
[BibTeX]
ChatGPT, schreibe mir einen Aufsatz über Ursula Georgy.
In:
S. Fühles-Ubach, A. Oßwald, F. Schade and R. Seidler-de Alwis, editors,
Engagement in der Informationswissenschaft - Festschrift für Ursula Georgy, pages 36-51.
b.i.t.verlag, Wiesbaden, 2024.
Claudia Frick and Philipp Schaer.
[pdf]
[BibTeX]
Teaching Information Retrieval with a Shared Task
Across Universities: First Steps and Findings.
In:
Proceedings of LWDA’24: Lernen, Wissen, Daten, Analysen. September 23–25, 2024, Würzburg, Germany, series CEUR Workshop Proceedings (CEUR-WS.org).
2024.
Maik Fröbe, Christopher Akiki, Timo Breuer, Thomas Eckart, Annemarie Friedrich, Lukas Gienapp, Jan Heinrich Merker, Martin Potthast, Harrisen Scells, Philipp Schaer and Benno Stein.
[pdf]
[BibTeX]
Investigating Bias in Political Search Query Suggestions by Relative Comparison with LLMs.
In:
WEBSCI '24: Proceedings of the 16th ACM Web Science Conference.
ACM, 2024.
Fabian Haak, Björn Engelmann, Christin Katharina Kreutz and Philipp Schaer.
[BibTeX]
Evaluation of Temporal Change in IR Test Collections.
In:
Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval, series ICTIR '24, pages 3–13.
Association for Computing Machinery, New York, NY, USA, 2024.
Jüri Keller, Timo Breuer and Philipp Schaer.
[doi] [pdf]
[abstract]
[BibTeX]
Information retrieval systems have been evaluated using the Cranfield paradigm for many years. This paradigm allows a systematic, fair, and reproducible evaluation of different retrieval methods in fixed experimental environments. However, real-world retrieval systems must cope with dynamic environments and temporal changes that affect the document collection, topical trends, and the individual user's perception of what is considered relevant. Yet, the temporal dimension in IR evaluations is still understudied.To this end, this work investigates how the temporal generalizability of effectiveness evaluations can be assessed. As a conceptual model, we generalize Cranfield-type experiments to the temporal context by classifying the change in the essential components according to the create, update, and delete operations of persistent storage known from CRUD. From the different types of change different evaluation scenarios are derived and it is outlined what they imply. Based on these scenarios, renowned state-of-the-art retrieval systems are tested and it is investigated how the retrieval effectiveness changes on different levels of granularity.We show that the proposed measures can be well adapted to describe the changes in the retrieval results. The experiments conducted confirm that the retrieval effectiveness strongly depends on the evaluation scenario investigated. We find that not only the average retrieval performance of single systems but also the relative system performance are strongly affected by the components that change and to what extent these components changed.
Leveraging Prior Relevance Signals in Web Search.
In: G. Faggioli, N. Ferro, P. Galuscáková and A. G. S. de Herrera, editors,
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), Grenoble, France, 9-12 September, 2024, volume 3740, series CEUR Workshop Proceedings, pages 2396-2406.
CEUR-WS.org, 2024.
Jüri Keller, Timo Breuer and Philipp Schaer.
[doi]
[BibTeX]
Replicability Measures for Longitudinal Information Retrieval Evaluation.
In:
Experimental IR Meets Multilinguality, Multimodality, and Interaction - 15th International Conference of the CLEF Association, CLEF 2024, Grenoble, France, September 9–12, 2024, Proceedings, Part I.
Springer Cham, 2024.
Jüri Keller, Timo Breuer and Philipp Schaer.
[pdf]
[BibTeX]
BATS: BenchmArking Text Simplicity.
In: L.-W. Ku, A. Martins and V. Srikumar, editors,
Findings of the Association for Computational Linguistics ACL 2024, pages 11968-11989.
Association for Computational Linguistics, Bangkok, Thailand and virtual meeting, 2024.
Christin Kreutz, Fabian Haak, Björn Engelmann and Philipp Schaer.
[doi]
[abstract]
[BibTeX]
Evaluation of text simplification currently focuses on the difference of a source text to its simplified variant. Datasets for this evaluation base on a specific topic and group of readers for which is simplified. The broad applicability of text simplification and specifics that come with intended target audiences (e.g., children compared to adult non-experts) are disregarded. An explainable assessment of the overall simplicity of text is missing. This work is BenchmArking Text Simplicity (BATS): we provide an explainable method to assess practical and concrete rules from literature describing features of simplicity and complexity of text. Our experiments on 15 datasets for text simplification highlight differences in features that are important in different domains of text and for different intended target audiences.
Evaluating Stability of Information Needs.
In:
Proceedings of BIR@ECIR 2024.
CEUR, 2024.
Christin Katharina Kreutz, Philipp Schaer and Ralf Schenkel.
[doi] [pdf]
[BibTeX]
Editorial to the special issue on JCDL 2022.
International Journal on Digital Libraries, 2024.
Philipp Mayr, Annika Hinze and Philipp Schaer.
[doi]
[abstract]
[BibTeX]
This special issue features the selected works of authors who have presented papers at the 2022 iteration of the Joint Conference on Digital Libraries (JCDL) in Cologne, Germany. The motto of the conference was ``Bridging Worlds'' and was run as a fully hybrid event. Ten papers covering all aspects of Digital Libraries, namely Natural Language Processing, Information Retrieval, User Behavior, Scholarly Communication, Classification, Information Extraction are included in this issue.
Dynamics in Search Engine Query Suggestions for European Politicians.
In:
WEBSCI '24: Proceedings of the 16th ACM Web Science Conference, pages 279–289.
2024.
Franziska Pradel, Fabian Haak, Sven-Oliver Proksch and Philipp Schaer.
[doi]
[BibTeX]
SIGIR 2024 Workshop on Simulations for Information Access (Sim4IA 2024).
In:
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, series SIGIR '24, pages 3058–3061.
Association for Computing Machinery, New York, NY, USA, 2024.
Philipp Schaer, Christin Katharina Kreutz, Krisztian Balog, Timo Breuer and Norbert Fuhr.
[doi] [pdf]
[abstract]
[BibTeX]
Simulations in various forms have been used to evaluate information access systems, like search engines, recommender systems, or conversational agents. In the form of the Cranfield paradigm, a simulation setup is well-known in the IR community, but user simulations have recently gained interest. While user simulations help to reduce the complexity of evaluation experiments and help with reproducibility, they can also contribute to a better understanding of users. Building on recent developments in methods and toolkits, the Sim4IA workshop aims to bring together researchers and practitioners to form an interactive and engaging forum for discussions on the future perspectives of the field. An additional aim is to plan an upcoming TREC/CLEF campaign.
Bibliometric Data Fusion for Biomedical Information Retrieval.
In:
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2023, Santa Fe, NM, USA, June 26-30, 2023, pages 107-118.
IEEE, 2023.
Timo Breuer, Christin Katharina Kreutz, Philipp Schaer and Dirk Tunger.
[doi] [pdf]
[BibTeX]
Reliable Rules for Relation Extraction in a Multimodal Setting.
In: B. König-Ries, S. Scherzinger, W. Lehner and G. Vossen, editors,
Datenbanksysteme für Business, Technologie und Web (BTW 2023), 20. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 06.-10, März 2023, Dresden, Germany, Proceedings, volume P-331, series LNI, pages 1009-1021.
Gesellschaft für Informatik e.V., 2023.
Björn Engelmann and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Simulating Users in Interactive Web Table Retrieval.
In: I. Frommholz, F. Hopfgartner, M. Lee, M. Oakes, M. Lalmas, M. Zhang and R. L. T. Santos, editors,
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023, Birmingham, United Kingdom, October 21-25, 2023, pages 3875-3879.
ACM, 2023.
Björn Engelmann, Timo Breuer and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Text Simplification of Scientific Texts for Non-Expert Readers.
In:
SimpleText@CLEF-2023, volume abs/2307.03569, series CEUR Workshop Proceedings.
2023.
Björn Engelmann, Fabian Haak, Christin Katharina Kreutz, Narjes Nikzad-Khasmakhi and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Qbias - A Dataset on Media Bias in Search Queries and Query Suggestions.
In:
Proceedings of the 15th ACM Web Science Conference 2023, series WebSci '23, pages 239–244.
Association for Computing Machinery, New York, NY, USA, 2023.
Fabian Haak and Philipp Schaer.
[doi] [pdf]
[abstract]
[BibTeX]
This publication describes the motivation and generation of Qbias, a large dataset of Google and Bing search queries, a scraping tool and dataset for biased news articles, as well as language models for the investigation of bias in online search. Web search engines are a major factor and trusted source in information search, especially in the political domain. However, biased information can influence opinion formation and lead to biased opinions. To interact with search engines, users formulate search queries and interact with search query suggestions provided by the search engines. A lack of datasets on search queries inhibits research on the subject. We use Qbias to evaluate different approaches to fine-tuning transformer-based language models with the goal of producing models capable of biasing text with left and right political stance. Additionally to this work we provided datasets and language models for biasing texts that allow further research on bias in online information search.
Automated Statement Extraction from Press Briefings.
In: B. König-Ries, S. Scherzinger, W. Lehner and G. Vossen, editors,
Datenbanksysteme für Business, Technologie und Web (BTW 2023), 20. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 06.-10, März 2023, Dresden, Germany, Proceedings, volume P-331, series LNI, pages 1049-1057.
Gesellschaft für Informatik e.V., 2023.
Jüri Keller, Meik Bittkowski and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Evaluating Temporal Persistence Using Replicability Measures.
In: M. Aliannejadi, G. Faggioli, N. Ferro and M. Vlachos, editors,
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), Thessaloniki, Greece, September 18th to 21st, 2023, volume 3497, series CEUR Workshop Proceedings, pages 2441-2457.
CEUR-WS.org, 2023.
Jüri Keller, Timo Breuer and Philipp Schaer.
[doi]
[BibTeX]
Capturing Stability of Information Needs in Digital Libraries.
In:
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2023, Santa Fe, NM, USA, June 26-30, 2023, pages 276-278.
IEEE, 2023.
Christin Katharina Kreutz, Philipp Schaer and Ralf Schenkel.
[doi] [pdf]
[BibTeX]
Evaluating Digital Library Search Systems by Using Formal Process
Modelling.
In:
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2023, Santa Fe, NM, USA, June 26-30, 2023, pages 1-12.
IEEE, 2023.
Christin Katharina Kreutz, Martin Blum, Philipp Schaer, Ralf Schenkel and Benjamin Weyers.
[doi] [pdf]
[BibTeX]
An in-depth investigation on the behavior of measures to quantify reproducibility.
Information Processing and Management, 60(3):103332, 2023.
Maria Maistro, Timo Breuer, Philipp Schaer and Nicola Ferro.
[doi] [pdf]
[BibTeX]
ConvGenVisMo: Evaluation of Conversational Generative Vision Models.
In:
ICML 2023 Workshop Artificial Intelligence and Human-Computer Interaction.
2023.
Narjes Nikzad-Khasmakhi, Meysam Asgari-Chenaghlu, Nabiha Asghar, Philipp Schaer and Dietlind Zühlke.
[doi] [pdf]
[BibTeX]
Preliminary Results of a Scientometric Analysis of the German Information Retrieval Community 2020-2023.
In: M. Leyer and J. Wichmann, editors,
Proceedings of the LWDA 2023 Workshops: BIA, DB, IR, KDML and WM. Marburg, Germany, 09.-11. October 2023, volume 3630, series CEUR Workshop Proceedings, pages 222-230.
CEUR-WS.org, 2023.
Philipp Schaer, Svetlana Myshkina and Jüri Keller.
[doi] [pdf]
[BibTeX]
Sprachmodelle und neuronale Netze im Information Retrieval.
In:
R. Kuhlen, D. Lewandowski, W. Semar and C. Womser-Hacker, editors,
Grundlagen der Informationswissenschaft, chapter C 9, pages 455-466.
De Gruyter Saur, Berlin, Boston, 2023.
Philipp Schaer.
[doi] [pdf]
[BibTeX]
Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries.
JCDL '22.
ACM, New York, NY, USA, 2022.
OCLC: 1355716202.
Akiko Aizawa, Thomas Mandl, Zeljko Careciv, Annika Hinze, Philipp Mayr and Philipp Schaer.
[pdf]
[BibTeX]
irmetadata: An Extensible Metadata Schema for IR Experiments.
In:
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 3078-3089.
ACM, 2022.
Timo Breuer, Jüri Keller and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Relevance assessments, bibliometrics, and altmetrics: a quantitative study on PubMed and arXiv.
Scientometrics, 127(5):2455-2478, 2022.
Timo Breuer, Philipp Schaer and Dirk Tunger.
[doi] [pdf]
[BibTeX]
Validating Simulations of User Query Variants.
In: M. Hagen, S. Verberne, C. Macdonald, C. Seifert, K. Balog, K. Nørvåg and V. Setty, editors,
Advances in Information Retrieval - 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10-14, 2022, Proceedings, Part I, volume 13185, series Lecture Notes in Computer Science, pages 80-94.
Springer, 2022.
Timo Breuer, Norbert Fuhr and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Auditing Search Query Suggestion Bias Through Recursive Algorithm Interrogation.
In:
14th ACM Web Science Conference 2022, pages 219–227.
ACM, 2022.
Fabian Haak and Philipp Schaer.
[doi]
[BibTeX]
A Living Lab Architecture for Reproducible Shared Task Experimentation.
In:
Information between Data and Knowledge, pages 348-362.
Werner Hülsbusch, Glückstadt, 2021.
Session 6: Emerging Technologies
Timo Breuer and Philipp Schaer.
[pdf]
[abstract]
[BibTeX]
No existing evaluation infrastructure for shared tasks currently supports both reproducible on- and offline experiments. In this work, we present an architecture that ties together both types of experiments with a focus on reproducibility. The readers are provided with a technical description of the infrastructure and details of how to contribute their own experiments to upcoming evaluation tasks.
Evaluating Elements of Web-based Data Enrichment for Pseudo-Relevance Feedback Retrieval.
In:
Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Twelfth International Conference of the CLEF Association (CLEF 2021), series Lecture Notes in Computer Science.
Springer Nature, 2021.
Timo Breuer, Melanie Pest and Philipp Schaer.
[pdf]
[BibTeX]
reproeval: A Python Interface to Reproducibility Measures of System-Oriented IR Experiments.
In: D. Hiemstra, M. Moens, J. Mothe, R. Perego, M. Potthast and F. Sebastiani, editors,
Advances in Information Retrieval - 43rd European Conference on IR Research, ECIR 2021, Virtual Event, March 28 - April 1, 2021, Proceedings, Part II, volume 12657, series Lecture Notes in Computer Science, pages 481-486.
Springer, 2021.
Timo Breuer, Nicola Ferro, Maria Maistro and Philipp Schaer.
[doi] [pdf]
[BibTeX]
IRCologne at TREC 2021 News Track - Relation-based re-ranking for background linking.
In:
TREC.
National Institute of Standards and Technology (NIST), 2021.
Björn Engelmann and Philipp Schaer.
[pdf]
[BibTeX]
Perception-Aware Bias Detection for Query Suggestions.
In: L. Boratto, S. Faralli, M. Marras and G. Stilo, editors,
Advances in Bias and Fairness in Information Retrieval - Second International Workshop on Algorithmic Bias in Search and Recommendation, BIAS 2021, Lucca, Italy, April 1, 2021, Proceedings, volume 1418, series Communications in Computer and Information Science.
Springer Nature, Switzerland, 2021.
Fabian Haak and Philipp Schaer.
[pdf]
[BibTeX]
Living Lab Evaluation for Life and Social Sciences Search Platforms - LiLAS at CLEF 2021.
In: D. Hiemstra, M. Moens, J. Mothe, R. Perego, M. Potthast and F. Sebastiani, editors,
Advances in Information Retrieval - 43rd European Conference on IR Research, ECIR 2021, Virtual Event, March 28 - April 1, 2021, Proceedings, Part II, volume 12657, series Lecture Notes in Computer Science, pages 657-664.
Springer, 2021.
Philipp Schaer, Johann Schaible and Leyla Jael Garca Castro.
[doi] [pdf]
[BibTeX]
Overview of LiLAS 2021 - Living Labs for Academic Search.
In: K. S. Candan, B. Ionescu, L. Goeuriot, B. Larsen, H. Müller, A. Joly, M. Maistro, F. Piroi, G. Faggioli and N. Ferro, editors,
Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Twelfth International Conference of the CLEF Association (CLEF 2021), volume 12880, series Lecture Notes in Computer Science.
2021.
Philipp Schaer, Timo Breuer, Leyla Jael Castro, Benjamin Wolff, Johann Schaible and Narges Tavakolpoursaleh.
[pdf]
[BibTeX]
Overview of LiLAS 2021 - Living Labs for Academic Search (Extended Overview).
In: G. Faggioli, N. Ferro, A. Joly, M. Maistro and F. Piroi, editors,
Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, series CEUR Workshop Proceedings.
2021.
Philipp Schaer, Timo Breuer, Leyla Jael Castro, Benjamin Wolff, Johann Schaible and Narges Tavakolpoursaleh.
[pdf]
[BibTeX]
How to Measure the Reproducibility of System-oriented IR Experiments.
In: J. Huang, Y. Chang, X. Cheng, J. Kamps, V. Murdock, J.-R. Wen and Y. Liu, editors,
SIGIR, pages 349-358.
ACM, 2020.
Timo Breuer, Nicola Ferro, Norbert Fuhr, Maria Maistro, Tetsuya Sakai, Philipp Schaer and Ian Soboroff.
[doi] [pdf]
[BibTeX]
Relations Between Relevance Assessments, Bibliometrics and Altmetrics.
In:
Proceedings of the 10th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 42nd European Conference on Information Retrieval, BIR@ECIR 2020, Lisbon, Portugal, April 14th, 2020 [online only], pages 101-112.
2020.
Timo Breuer, Philipp Schaer and Dirk Tunger.
[doi] [pdf]
[BibTeX]
Conference Indexing in Digital Libraries: A Ranking Model and Case
Study on dblp.
In:
Proceedings of the 10th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 42nd European Conference on Information Retrieval, BIR@ECIR 2020, Lisbon, Portugal, April 14th, 2020 [online only], pages 30-41.
2020.
Christopher Michels, Mandy Neumann, Philipp Schaer and Ralf Schenkel.
[doi] [pdf]
[BibTeX]
Editorial.
Datenbank-Spektrum, 20(1):1-3, 2020.
Philipp Schaer, Klaus Berberich and Theo Härder.
[doi] [pdf]
[BibTeX]
Licht und Schatten bei Online-Experimenten – Living Labs aus Sicht eines Forschenden.
In:
C. Kaminsky, U. Seelmeyer, S. Siebert and P. Werner, editors,
Digitale Technologien zwischen Lenkung und Selbstermächtigung.
Beltz Juventa, Weinheim Basel, 2020.
Philipp Schaer.
[pdf]
[BibTeX]
Living Labs for Academic Search at CLEF 2020.
In: J. M. Jose, E. Yilmaz, J. Magalhães, P. Castells, N. Ferro, Má. J. Silva and F. Martins, editors,
Advances in Information Retrieval, pages 580-586.
Springer International Publishing, Cham, 2020.
Philipp Schaer, Johann Schaible and Bernd Müller.
[pdf]
[abstract]
[BibTeX]
The need for innovation in the field of academic search and IR, in general, is shown by the stagnating system performance in controlled evaluation campaigns, as demonstrated in TREC and CLEF meta-evaluation studies, as well as user studies in real systems of scientific information and digital libraries. The question of what constitutes relevance in academic search is multi-layered and a topic that drives research communities for years. The Living Labs for Academic Search (LiLAS) workshop has the goal to inspire the discussion on research and evaluation of academic search systems by strengthening the concept of living labs to the domain of academic search. We want to bring together IR researchers interested in online evaluations of academic search systems and foster knowledge on improving the search for academic resources like literature, research data, and the interlinking between these resources. The employed online evaluation approach based on a living lab infrastructure allows the direct connection to real-world academic search systems from the life sciences and the social sciences.
Overview of LiLAS 2020 - Living Labs for Academic Search.
In: A. Arampatzis, E. Kanoulas, T. Tsikrika, S. Vrochidis, H. Joho, C. Lioma, C. Eickhoff, A. Névéol, L. Cappellato and N. Ferro, editors,
CLEF, volume 12260, series Lecture Notes in Computer Science, pages 364-371.
Springer, 2020.
Philipp Schaer, Johann Schaible and Leyla Jael García Castro.
[doi] [pdf]
[BibTeX]
Overview of LiLAS 2020 - Living Labs for Academic Search Workshop Lab (extended abstract).
In: L. Cappellato, C. Eickhoff, N. Ferro and A. Névéol, editors,
CLEF (Working Notes), volume 2696, series CEUR Workshop Proceedings.
CEUR-WS.org, 2020.
Philipp Schaer, Johann Schaible and Leyla Jael García-Castro.
[doi] [pdf]
[BibTeX]
Evaluation Infrastructures for Academic Shared Tasks.
Datenbank-Spektrum, 20(1):29-36, 2020.
Johann Schaible, Timo Breuer, Narges Tavakolpoursaleh, Bernd Müller, Benjamin Wolff and Philipp Schaer.
[doi] [pdf]
[abstract]
[BibTeX]
Academic search systems aid users in finding information covering specific topics of scientific interest and have evolved from early catalog-based library systems to modern web-scale systems. However, evaluating the performance of the underlying retrieval approaches remains a challenge. An increasing amount of requirements for producing accurate retrieval results have to be considered, e.g., close integration of the system's users. Due to these requirements, small to mid-size academic search systems cannot evaluate their retrieval system in-house. Evaluation infrastructures for shared tasks alleviate this situation. They allow researchers to experiment with retrieval approaches in specific search and recommendation scenarios without building their own infrastructure. In this paper, we elaborate on the benefits and shortcomings of four state-of-the-art evaluation infrastructures on search and recommendation tasks concerning the following requirements: support for online and offline evaluations, domain specificity of shared tasks, and reproducibility of experiments and results. In addition, we introduce an evaluation infrastructure concept design aiming at reducing the shortcomings in shared tasks for search and recommender systems.
Information Extraction for Semi-structured Email Corpora.
In: R. Jäschke and M. Weidlich, editors,
LWDA, volume 2454, series CEUR Workshop Proceedings, pages 322-330.
CEUR-WS.org, 2019.
Hendrik Adam and Philipp Schaer.
[pdf]
[BibTeX]
An investigation of biases in web search engine query suggestions.
Online Information Review, 44(2):365-381, 2019.
Malte Bonart, Anastasiia Samokhina, Gernot Heisenberg and Philipp Schaer.
[doi] [pdf]
[abstract]
[BibTeX]
Purpose
Survey-based studies suggest that search engines are trusted more than social media or even traditional news, although cases of false information or defamation are known. The purpose of this paper is to analyze query suggestion features of three search engines to see if these features introduce some bias into the query and search process that might compromise this trust. The authors test the approach on person-related search suggestions by querying the names of politicians from the German Bundestag before the German federal election of 2017.
Design/methodology/approach
This study introduces a framework to systematically examine and automatically analyze the varieties in different query suggestions for person names offered by major search engines. To test the framework, the authors collected data from the Google, Bing and DuckDuckGo query suggestion APIs over a period of four months for 629 different names of German politicians. The suggestions were clustered and statistically analyzed with regards to different biases, like gender, party or age and with regards to the stability of the suggestions over time.
Findings
By using the framework, the authors located three semantic clusters within the data set: suggestions related to politics and economics, location information and personal and other miscellaneous topics. Among other effects, the results of the analysis show a small bias in the form that male politicians receive slightly fewer suggestions on “personal and misc” topics. The stability analysis of the suggested terms over time shows that some suggestions are prevalent most of the time, while other suggestions fluctuate more often.
Originality/value
This study proposes a novel framework to automatically identify biases in web search engine query suggestions for person-related searches. Applying this framework on a set of person-related query suggestions shows first insights into the influence search engines can have on the query process of users that seek out information on politicians.
Dockerizing Automatic Routing Runs for The Open-Source IR Replicability Challenge (OSIRRC 2019).
In: R. Clancy, N. Ferro, C. Hauff, J. Lin, T. Sakai and Z. Z. Wu, editors,
OSIRRC@SIGIR, volume 2409, series CEUR Workshop Proceedings, pages 31-35.
CEUR-WS.org, 2019.
Timo Breuer and Philipp Schaer.
[pdf]
[BibTeX]
Replicability and Reproducibility of Automatic Routing Runs.
In: L. Cappellato, N. Ferro, D. E. Losada and H. Müller, editors,
CLEF (Working Notes), volume 2380, series CEUR Workshop Proceedings.
CEUR-WS.org, 2019.
Timo Breuer and Philipp Schaer.
[pdf]
[BibTeX]
STELLA: Towards a Framework for the Reproducibility of Online Search Experiments.
In: R. Clancy, N. Ferro, C. Hauff, J. Lin, T. Sakai and Z. Z. Wu, editors,
OSIRRC@SIGIR, volume 2409, series CEUR Workshop Proceedings, pages 8-11.
CEUR-WS.org, 2019.
Timo Breuer, Philipp Schaer, Narges Tavakolpoursaleh, Johann Schaible, Benjamin Wolff and Bernd Müller.
[pdf]
[BibTeX]
Data Librarian – ein neuer Studienschwerpunkt für wissenschaftliche Bibliotheken und Forschungseinrichtungen.
Bibliothek Forschung und Praxis, 43(2018/04):255–261, 2019.
Simone Fühles-Ubach, Philipp Schaer, Klaus Lepsky and Ragna Seidler-de Alwis.
[pdf]
[BibTeX]
Computational Methods in Professional Communication.
In:
ProComm, pages 275-285.
IEEE, 2019.
André Calero Valdez, Lena Adam, Dennis Assenmacher, Laura Burbach, Malte Bonart, Lena Frischlich and Philipp Schaer.
[pdf]
[BibTeX]
Intertemporal Connections Between Query Suggestions and Search Engine Results for Politics Related Queries.
In:
EuroCSS 2018 Dataset Challenge.
Cologne, 2018.
Malte Bonart and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Towards an IR Test Collection for the German National Library.
In: R. Gemulla, S. P. Ponzetto, C. Bizer, M. Keuper and H. Stuckenschmidt, editors,
LWDA, volume 2191, series CEUR Workshop Proceedings, pages 275-280.
CEUR-WS.org, 2018.
Johanna Munkelt, Philipp Schaer and Klaus Lepsky.
[doi] [pdf]
[BibTeX]
Prioritizing and Scheduling Conferences for Metadata Harvesting in dblp.
In:
JCDL '18 Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries , pages 45-48.
ACM, New York, NY, USA, 2018.
Mandy Neumann, Christopher Michels, Philipp Schaer and Schenkel Ralf.
[doi] [pdf]
[abstract]
[BibTeX]
Maintaining literature databases and online bibliographies is a core responsibility of metadata aggregators such as digital libraries. In the process of monitoring all the available data sources the question arises which data source should be prioritized. Based on a broad definition of information quality we are looking for different ways to find the best fitting and most promising conference candidates to harvest next. We evaluate different conference ranking features by using a pseudo-relevance assessment and a component-based evaluation of our approach.
Overview of TREC OpenSearch 2017.
In: E. M. Voorhees and A. Ellis, editors,
TREC, volume Special Publication 500-324.
National Institute of Standards and Technology (NIST), 2017.
Rolf Jagerman, Krisztian Balog, Philipp Schaer, Johann Schaible, Narges Tavakolpoursaleh and Maarten de Rijke.
[doi] [pdf]
[BibTeX]
Web-Scraping for Non-Programmers: Introducing OXPath for Digital Library Metadata Harvesting.
Code4Lib Journal, 38, 2017.
Mandy Neumann, Jan Steinberg and Philipp Schaer.
[doi] [pdf]
[abstract]
[BibTeX]
Building up new collections for digital libraries is a demanding task. Available data sets have to be extracted which is usually done with the help of software developers as it involves custom data handlers or conversion scripts. In cases where the desired data is only available on the data provider’s website custom web scrapers are needed. This may be the case for small to medium-size publishers, research institutes or funding agencies. As data curation is a typical task that is done by people with a library and information science background, these people are usually proficient with XML technologies but are not full-stack programmers. Therefore we would like to present a web scraping tool that does not demand the digital library curators to program custom web scrapers from scratch. We present the open-source tool OXPath, an extension of XPath, that allows the user to define data to be extracted from websites in a declarative way. By taking one of our own use cases as an example, we guide you in more detail through the process of creating an OXPath wrapper for metadata harvesting. We also point out some practical things to consider when creating a web scraper (with OXPath). On top of that, we also present a syntax highlighting plugin for the popular text editor Atom that we developed to further support OXPath users and to simplify the authoring process.
Enriching Existing Test Collections with OXPath.
In: G. J. F. Jones, S. Lawless, J. Gonzalo, L. Kelly, L. Goeuriot, T. Mandl, L. Cappellato and F. Nicola, editors,
Experimental IR Meets Multilinguality, Multimodality, and Interaction 8th International Conference of the CLEF Association, CLEF 2017, Dublin, Ireland, September 11-14, 2017, Proceedings, volume 10456, series Lecture Notes in Computer Science.
2017.
Philipp Schaer and Mandy Neumann.
[doi] [pdf]
[abstract]
[BibTeX]
Extending TREC-style test collections by incorporating external resources is
a time consuming and challenging task. Making use of freely available web data
requires technical skills to work with APIs or to create a web scraping program
specifically tailored to the task at hand. We present a light-weight
alternative that employs the web data extraction language OXPath to harvest
data to be added to an existing test collection from web resources. We
demonstrate this by creating an extended version of GIRT4 called GIRT4-XT with
additional metadata fields harvested via OXPath from the social sciences portal
Sowiport. This allows the re-use of this collection for other evaluation
purposes like bibliometrics-enhanced retrieval. The demonstrated method can be
applied to a variety of similar scenarios and is not limited to extending
existing collections but can also be used to create completely new ones with
little effort.
Living Labs - An Ethical Challenge for Researchers and Platform Providers.
In:
M. Zimmer and K. Kinder-Kurlanda, editors,
Internet Research Ethics for the Social Age: New Challenges, Cases, and Contexts.
Peter Lang, 2017.
Philipp Schaer.
[pdf]
[BibTeX]
IR-Cologne at TREC 2017 OpenSearch Track: Rerunning Popularity Ranking Experiments in a Living Lab.
In:
The Twenty-Sixth Text REtrieval Conference Proceedings (TREC 2017) , series NIST Special Publication.
National Institute of Standards and Technology (NIST), 2017.
Narges Tavakolpoursaleh, Mandy Neumann and Philipp Schaer.
[pdf]
[BibTeX]
Overview of the TREC 2016 Open Search track.
In: E. M. Voorhees and A. Ellis, editors,
TREC, volume Special Publication 500-321.
National Institute of Standards and Technology (NIST), 2016.
Krisztian Balog, Anne Schuth, Peter Dekker, Philipp Schaer, Po-Yu Chuang and Narges Tavakolpoursaleh.
[pdf]
[BibTeX]
How Relevant is the Long Tail? - A Relevance Assessment Study on Million Short.
In: N. Fuhr, P. Quaresma, T. Gonçalves, B. Larsen, K. Balog, C. Macdonald, L. Cappellato and N. Ferro, editors,
Experimental IR Meets Multilinguality, Multimodality, and Interaction - 7th International Conference of the CLEF Association, CLEF 2016, Évora, Portugal, September 5-8, 2016, Proceedings, volume 9822, series Lecture Notes in Computer Science, pages 227-233.
Springer, 2016.
Philipp Schaer, Philipp Mayr, Sebastian Sünkler and Dirk Lewandowski.
[pdf]
[BibTeX]
Ideas for a Standard LL4IR Extension - Living Labs from a System Operator's Perspective.
In: K. Balog, L. Cappellato, N. Ferro and C. Macdonald, editors,
Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Évora, Portugal, 5-8 September, 2016, volume 1609, series CEUR Workshop Proceedings, pages 591-592.
CEUR-WS.org, 2016.
Philipp Schaer and Narges Tavakolpoursaleh.
[pdf]
[BibTeX]
Popularity Ranking for Scientific Literature Using the Characteristic Scores and Scale Method.
In: E. M. Voorhees and A. Ellis, editors,
TREC, volume Special Publication 500-321.
National Institute of Standards and Technology (NIST), 2016.
Philipp Schaer and Narges Tavakolpoursaleh.
[doi] [pdf]
[BibTeX]
Query Expansion for Survey Question Retrieval in the Social Sciences.
In:
Proceedings of 19th International Conference on Theory and Practice of Digital Libraries 2015 (TPDL 2015).
Springer, 2015.
Nadine Dulisch, Andreas Oskar Kempf and Philipp Schaer.
[pdf]
[BibTeX]
A System for Probabilistic Linking of Thesauri and Classification Systems.
KI - Künstliche Intelligenz:1-4, 2015.
Lisa Posch, Philipp Schaer, Arnim Bleier and Markus Strohmaier.
[doi] [pdf]
[BibTeX]
The Polylingual Labeled Topic Model .
In: S. Hölldobler, M. Krötzsch, R. Peñaloza and S. Rudolph, editors,
KI 2015: Advances in Artificial Intelligence, volume 9324, series Lecture Notes in Computer Science, pages 295-301.
Springer, 2015.
Lisa Posch, Arnim Bleier, Philipp Schaer and Markus Strohmaier.
[pdf]
[BibTeX]
Historical Clicks for Product Search: GESIS at CLEF LL4IR 2015.
In:
CLEF 2015 Workshop Proceedings.
2015.
Philipp Schaer and Narges Tavakolpoursaleh.
[pdf]
[BibTeX]
On the Connection Between Citation-based and Topical Relevance Ranking: Results of a Pretest using iSearch.
In:
Proceedings of the First Workshop on Bibliometric-enhanced Information Retrieval, volume 1145, series CEUR Workshop Proceedings, pages 37-44.
Amsterdam, The Netherlands, 2014.
urn:nbn:de:0074-1143-7
Zeljko Carevic and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Bibliometric-Enhanced Information Retrieval.
In:
Proceedings of the 36th International European Conference on Information Retrieval, ECIR'14, April 13 - April 16, 2014, Amsterdam, The Netherlands, volume 8416, series Lecture Note in Computer Science, pages 798–801.
Springer, 2014.
Philipp Mayr, Andrea Scharnhorst, Birger Larsen, Philipp Schaer and Peter Mutschke.
[pdf]
[abstract]
[BibTeX]
Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can improve re- trieval services for specific communities, as well as for large, cross-domain collections. This workshop aims to raise awareness of the missing link between information retrieval (IR) and bibliometrics / scientometrics and to create a com- mon ground for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface.
Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014.
In:
Proceedings of the First Workshop on Bibliometric-enhanced Information Retrieval, volume 1145, series CEUR Workshop Proceedings, pages 37-44.
Amsterdam, The Netherlands, 2014.
urn:nbn:de:0074-1143-7
Philipp Mayr, Philipp Schaer, Andrea Scharnhorst and Peter Mutschke.
[doi] [pdf]
[BibTeX]
Performing Informetric Analysis on Information Retrieval Test Collections: Preliminary Experiments in the Physics Domain.
In:
14th International Society of Scientometrics and Informetrics Conference ISSI, volume 2, pages 1392-1400.
2013.
Tamara Heck and Philipp Schaer.
[pdf]
[abstract]
[BibTeX]
The combination of informetric analysis and information retrieval allows a twofold application. (1) While in-formetrics analysis is primarily used to gain insights into a scientific domain, it can be used to build recommen-dation or alternative ranking services. They are usually based on methods like co-occurrence or citation analyses. (2) Information retrieval and its decades-long tradition of rigorous evaluation using standard document corpora, predefined topics and relevance judgements can be used as a test bed for informetric analyses. We show a preliminary experiment on how both domains can be connected using the iSearch test collection, a standard information retrieval test collection derived from the open access arXiv.org preprint server. In this paper the aim is to draw a conclusion about the appropriateness of iSearch as a test bed for the evaluation of a retrieval or recommendation system that applies informetric methods to improve retrieval results for the user. Based on an interview study with physicists, bibliographic coupling and author-co-citation analysis, important authors for ten different research questions are identified. The results show that the analysed corpus includes these authors and their corresponding documents. This study is a first step towards a combination of retrieval evaluations and the evaluation of informetric analyses methods.
A Framework for Specific Term Recommendation Systems.
In:
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, pages 1093–1094.
ACM, New York, NY, USA, 2013.
Thomas Lüke, Philipp Schaer and Philipp Mayr.
[pdf]
[BibTeX]
An OAI-PMH-based Web-Service for the Generation of Co-Author-Networks.
In:
Proceedings of the International Symposium of Information Science (ISI 2013).
2013.
Philipp Schaer, Thomas Lüke, Philipp Mayr and Peter Mutschke.
[pdf]
[BibTeX]
Applied Informetrics for Digital Libraries: An Overview of Foundations, Problems and Current Approaches.
Historical Social Research, 38(3):267-281, 2013.
Philipp Schaer.
[pdf]
[BibTeX]
Der Nutzen informetrischer Analysen und nicht-textueller Dokumentattribute für das Information Retrieval in digitalen Bibliotheken.
PhD thesis, University Koblenz-Landau, Koblenz, 2013.
Philipp Schaer.
[doi] [pdf]
[BibTeX]
Information Retrieval und Informetrie: Zur Anwendung informetrischer Methoden in digitalen Bibliotheken.
Historical Social Research, 38(3):282-354, 2013.
Philipp Schaer.
[pdf]
[BibTeX]
Integrating Interactive Visualizations in the Search Process of Digital Libraries and IR Systems..
In: R. A. Baeza-Yates, A. P. de Vries, H. Zaragoza, B. B. Cambazoglu, V. Murdock, R. Lempel and F. Silvestri, editors,
ECIR, volume 7224, series Lecture Notes in Computer Science, pages 447-450.
Springer, 2012.
Daniel Hienert, Frank Sawitzki, Philipp Schaer and Philipp Mayr.
[doi] [pdf]
[BibTeX]
Improving Retrieval Results with Discipline-Specific Query Expansion..
In: P. Zaphiris, G. Buchanan, E. Rasmussen and F. Loizides, editors,
TPDL, volume 7489, series Lecture Notes in Computer Science, pages 408-413.
Springer, 2012.
Thomas Lüke, Philipp Schaer and Philipp Mayr.
[doi]
[BibTeX]
Extending Aggregated Search in a Social Sciences Digital Library.
In: B. Larsen, C. Lioma and A. De Vries, editors,
Proceedings of the Task Based and Aggregated Search Workshop (TBAS 2012).
2012.
Frank Sawitzki, Philipp Schaer and Daniel Hienert.
[pdf]
[BibTeX]
Better Than Their Reputation? On the Reliability of Relevance Assessments with Students.
In: T. Catarci, P. Forner, D. Hiemstra, A. Peñas and G. Santucci, editors,
Information Access Evaluation meets Multilinguality, Multimodality, and Visual Analytics Third International Conference of the CLEF Initiative - CLEF 2012, series LNCS, pages 126–137.
Springer-Verlag, Berlin, Heidelberg, 2012.
Philipp Schaer.
[pdf]
[abstract]
[BibTeX]
During the last three years we conducted several information retrieval evaluation series with more than 180 LIS students who made relevance assessments on the outcomes of three specific retrieval services. In this study we do not focus on the retrieval performance of our system but on the relevance assessments and the inter-assessor reliability. To quantify the agreement we apply Fleiss’ Kappa and Krippendorff’s Alpha. When we compare these two statistical measures on average Kappa values were 0.37 and Alpha values 0.15. We use the two agreement measures to drop too unreliable assessments from our data set. When computing the differences between the unfiltered and the filtered data set we see a root mean square error between 0.02 and 0.12. We see this as a clear indicator that disagreement affects the reliability of retrieval evaluations. We suggest not to work with unfiltered results or to clearly document the disagreement rates.
Building Custom Term Suggestion Web Services with OAI-Harvested Open Data.
In: M. Ockenfeld, I. Peters and K. Weller, editors,
Proceedings of the 64. DGI Annual Meeting and 2nd DGI-Conference, series Tagungen der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis, pages 389-396.
Deutsche Gesellschaft für Informationswissenschaft und Informationspraxis, 2012.
Philipp Schaer, Thomas Lüke and Wilko van Hoek.
[pdf]
[BibTeX]
Dealing with Sparse Document and Topic Representations: Lab Report for CHiC 2012.
In:
CLEF 2012 Labs and Workshop, Notebook Papers: CLEF/CHiC Workshop-Notes.
2012.
Philipp Schaer, Daniel Hienert, Frank Sawitzki, Andias Wira-Alam and Thomas Lüke.
[pdf]
[BibTeX]
Extending Term Suggestion with Author Names..
In: P. Zaphiris, G. Buchanan, E. Rasmussen and F. Loizides, editors,
TPDL, volume 7489, series Lecture Notes in Computer Science, pages 317-322.
Springer, 2012.
Philipp Schaer, Philipp Mayr and Thomas Lüke.
[doi]
[BibTeX]
A Novel Combined Term Suggestion Service for Domain-Specific Digital Libraries..
In: S. Gradmann, F. Borri, C. Meghini and H. Schuldt, editors,
TPDL, volume 6966, series Lecture Notes in Computer Science, pages 192-203.
Springer, 2011.
Daniel Hienert, Philipp Schaer, Johann Schaible and Philipp Mayr.
[doi] [pdf]
[BibTeX]
VIZGR: Combining Data on a Visual Level.
In:
Proceedings of the 7th International Conference on Web Information Systems and Technologies (WEBIST).
2011.
Daniel Hienert, Benjamin Zapilko, Philipp Schaer and Brigitte Mathiak.
[doi] [pdf]
[abstract]
[BibTeX]
In this paper we present a novel method to connect data on the visualization level. In general, visualizations are a dead end, when it comes to reusability. Yet, users prefer to work with visualizations as evidenced by WYSIWYG editors. To enable users to work with their data in a way that is intuitive to them, we have created Vizgr. Vizgr.com offers basic visualization methods, like graphs, tag clouds, maps and time lines. But unlike normal data visualizations, these can be re-used, connected to each other and to web sites. We offer a simple opportunity to combine diverse data structures, such as geo-locations and networks, with each other by a mouse click. In an evaluation, we found that over 85 % of the participants were able to use and understand this technology without any training or explicit instructions.
Vizgr: Linking Data in Visualizations.
In:
J. Cordeiro and J. Filipe, editors,
WEBIST 2011 Selected and Revised Papers.
Springer, 2011.
Daniel Hienert, Benjamin Zapilko, Philipp Schaer and Brigitte Mathiak.
[BibTeX]
Web-Based Multi-View Visualizations for Aggregated Statistics.
In: A. Bozzon, S. Comai and M. Norrie, editors,
Proceeding of the 2nd International Workshop on DATA Visualization and Integration on data-centric Web Services.
IEEE, 2011.
Daniel Hienert, Benjamin Zapilko, Philipp Schaer and Brigitte Mathiak.
[doi] [pdf]
[BibTeX]
A Science Model Driven Retrieval Prototype.
In: F. Boteram, W. Gödert and J. Hubrich, editors,
Proceedings of the Cologne Conference on Interoperability and Semantics in Knowledge Organization, volume 1, series Reihe Informations- und Bibliothekswissenschaften, pages 111-122.
Ergon Verlag, Würzburg, 2011.
Philipp Mayr, Philipp Schaer and Peter Mutschke.
[doi] [pdf]
[abstract]
[BibTeX]
This paper is about a better understanding of the structure and dynamics of
science and the usage of these insights for compensating the typical problems that
arises in metadata-driven Digital Libraries. Three science model driven retrieval services
are presented: co-word analysis based query expansion, re-ranking via Bradfordizing
and author centrality. The services are evaluated with relevance assessments
from which two important implications emerge: (1) precision values of the retrieval
service are the same or better than the tf-idf retrieval baseline and (2) each service retrieved
a disjoint set of documents. The different services each favor quite other – but
still relevant – documents than pure term-frequency based rankings. The proposed
models and derived retrieval services therefore open up new viewpoints on the scientific
knowledge space and provide an alternative framework to structure scholarly information
systems.
Applying Science Models for Search.
In: J. Griesbaum, T. Mandl and C. Womser-Hacker, editors,
Information und Wissen: global, sozial und frei? - Proceedings des 12. Internationalen Symposiums für Informationswissenschaft (ISI 2011), volume 58, series Schriften zur Informationswissenschaft, pages 184-196.
Verlag Werner Hülsbusch, Boizenburg, 2011.
Philipp Mayr, Peter Mutschke, Vivien Petras, Philipp Schaer and York Sure.
[doi] [pdf]
[BibTeX]
Mehrwertdienste für das Information Retrieval: das Projekt IRM.
In:
Wissen - Wissenschaft - Organisation, volume 12, series Fortschritte in der Wissensorganisation.
Ergon-Verlag, Würzburg, 2011.
(erscheint)
Philipp Mayr, Peter Mutschke, Philipp Schaer and York Sure.
[doi] [pdf]
[BibTeX]
Science models as value-added services for scholarly information systems..
Scientometrics, 89(1):349-364, 2011.
Peter Mutschke, Philipp Mayr, Philipp Schaer and York Sure.
[doi] [pdf]
[BibTeX]
Using Lotkaian Informetrics for Ranking in Digital Libraries.
In: C. Hoare and A. O'Riordan, editors,
Proceedings of the ASIS&T European Workshop 2011 (AEW 2011).
ASIS&T, Cork, Ireland, 2011.
Philipp Schaer.
[pdf]
[BibTeX]
Demonstrating a Service-Enhanced Retrieval System.
In: A. Grove, editor,
ASIST 2010 - Proceedings of the 73rd ASIS&T Annual Meeting, volume 47.
Pittsburgh, PA, USA, 2010.
Philipp Schaer, Philipp Mayr and Peter Mutschke.
[doi] [pdf]
[BibTeX]
Implications of Inter-Rater Agreement on a Student Information Retrieval Evaluation.
In: M. Atzmüller, D. Benz, A. Hotho and G. Stumme, editors,
Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen + Adaptivität.
Kassel, Germany, 2010.
Philipp Schaer, Philipp Mayr and Peter Mutschke.
[doi] [pdf]
[abstract]
[BibTeX]
This paper is about an information retrieval evaluation on three different retrieval-supporting services. All three services were designed to compensate typical problems that arise in metadata-driven Digital Libraries, which are not ade- quately handled by a simple tf-idf based retrieval. The services are: (1) a co-word analysis based query expansion mechanism and re-ranking via (2) Bradfordizing and (3) author centrality. The services are evaluated with relevance assessments conducted by 73 information science students. Since the students are neither information professionals nor domain experts the question of inter-rater agreement is taken into consideration. Two important implications emerge: (1) the inter-rater agreement rates were mainly fair to moderate and (2) after a data-cleaning step which erased the assessments with poor agreement rates the evaluation data shows that the three retrieval services returned disjoint but still relevant result sets.
Integration von Open Access Repositorien in Fachportale.
In:
Wissensspeicherung in digitalen Räumen. Nachhaltigkeit, Verfügbarkeit, semantische Interoperabilität, series Fortschritte in der Wissensorganisation, pages 245-251.
Ergon-Verlag, Würzburg, 2010.
Philipp Schaer.
[pdf]
[abstract]
[BibTeX]
Open Access Repositorien sind Online-Archive für frei im Internet zugängliche Publikationen im Volltext. Open Access Materialien oder die Open Access Repositorien selbst sind allerdings nur unzureichend in zentrale Fachportale (z.B. virtuelle Fachbibliotheken) eingebunden. Der Beitrag stellt SSOAR – Social Science Open Access Repository, einen disziplinären Open Access Volltextserver für die Sozialwissenschaften vor und zeigt wie dieser in das sozialwissenschaftliche Fachportal Sowiport integriert wird.
Aktivitäten von GESIS im Kontext von Open Data und Zugang zu sozialwissenschaftlichen Forschungsergebnissen.
In:
1. DGI-Konferenz, 62. DGI Jahrestagung - Semantic Web & Linked Data Elemente zukünftiger Informationsinfrastrukturen.
2010.
Anja Wilde, Agnieszka Wenninger, Oliver Hopt, Philipp Schaer and Benjamin Zapilko.
[pdf]
[abstract]
[BibTeX]
GESIS – Leibniz-Institut für Sozialwissenschaften betreibt mit dem Volltextserver SSOAR und der Registrierungsagentur für sozialwissenschaftliche Forschungsdaten da|ra zwei Plattformen zum Nachweis von wissenschaftlichen Ergebnissen in Form von Publikationen und Primärdaten. Beide Systeme setzen auf einen konsequenten Einsatz von Persistenten Identifikatoren (URN und DOI), was die Verknüpfung der durch da|ra registrierten Daten mit den Volltextdokumenten aus SSOAR sowie anderen Informationen aus den GESIS-Beständen ermöglicht. Zusätzlich wird durch den Einsatz von semantischen Technologien wie SKOS und RDF eine Verbindung zum Semantic Web hergestellt.
202 encyclopedia entries on information and computer science.
In:
K. Umlauf and S. Gradmann, editors,
Lexikon der Bibliotheks- und Informationswissenschaft. LBI.
Hiersemann, Stuttgart, 2009.
Philipp Schaer.
[BibTeX]
Enhancing Visibility: Integrating Grey Literature in the SOWIPORT Information Cycle.
In:
Ninth International Conference on Grey Literature: Grey Foundations in Information Landscape, series GL-conference series.
2008.
Maximilian Stempfhuber, Philipp Schaer and Wei Shen.
[pdf]
[BibTeX]
State-of-the-Art: Interaktion in Erweiterten Realitäten.
Research Report, Institut für Computervisualistik, Universität Koblenz-Landau, 2007.
Philipp Schaer and Marco Thum.
[pdf]
[BibTeX]
Visualisierung und Interaktion in Erweiterten Realitäten.
2007.
Philipp Schaer and Marco Thum.
[pdf]
[BibTeX]
Methodes and Services for Semantic Integration in Information Systems.
In:
German E-Science Conference.
Baden-Baden, 2007.
Maximilian Stempfhuber and Philipp Schaer.
[doi] [pdf]
[BibTeX]
Abstrakte Interaktionskonzepte in Erweiterten Realitäten.
Master's thesis (Diplomarbeit), Universität Koblenz-Landau, Campus Koblenz, Fachbereich 4 Informatik, Institut für Computervisualistik, 2006.
Philipp Schaer.
[pdf]
[BibTeX]
Grundlagen der Kognition und Perzeption für die Software-Ergonomie.
Research Report, Universität Koblenz-Landau, Institut für Computervisualistik, 2006.
Philipp Schaer and Holger Heuser.
[pdf]
[abstract]
[BibTeX]
Der folgende Arbeitsbericht soll eine kurze Zusammenfassung über die perzeptorischen und kognitiven Fähigkeiten des Menschen geben. Diese Zusammenfassung ist weit davon entfernt,umfassend zu sein. Jedoch bietet sie die Möglichkeit für Informatiker und Computervisualisten,einen kurzen Einblick in kognitionspsychologische Modelle zu gewinnen.Nacheinander sollen die Wahrnehmung des Menschen (Kapitel 2), die Funktionsweise desGedächtnisses (Kapitel 3), Kognition im Allgemeinen (Kapitel 4), die Theorien der mentalenModellkonstruktion, der dualen Kodierungstheorie (Kapitel 5), die Konstruktion mentalerModelle (Kapitel 6), die Wirkungsweise und Theorie der Metapher (Kapitel 7) und letztlich dieemotiven Faktoren (Kapitel 8) betrachtet werden.Jedes Kapitel besitzt eine kleine Zusammenfassung, die die Konsequenzen der jeweiligenErkenntnisse für die Entwicklung von Softwaresystemen erklärt.
Evaluation von Glanzlichtdetektionsverfahren in der Endoskopie.
Master's thesis (Studienarbeit), Universität Koblenz-Landau, Campus Koblenz, Fachbereich 4 Informatik, Institut für Computervisualistik, 2004.
Philipp Schaer.
[pdf]
[BibTeX]