Expert Details
Forensic Linguistics, Authorship Analysis, Data Science, Text Analytics, Machine Learning, Artificial Intelligence
ID: 727236
Illinois, USA
Expert's published research includes several influential results in this area. His work on determining author gender from writing style was the first to show clear differences between male and female writing in both formal and informal genres, and across multiple languages and time-periods. He has shown how certain aspects of personality are reliably indicated by certain patterns of word usage, obtaining more accurate results than competing approaches. A primary contribution of his research on stylistic text analysis, is a novel methodology for constructing meaningful features for authorship analysis (based on functional/semantic principles) which give better results than previous methods in many cases.
His research has also been instrumental in working to solidify the methodological bases of the fields of automated authorship attribution and computational linguistics, which heretofore were plagued by methodological inconsistencies and low standards of reproducibility. He was the first to give a clear theoretical explanation of the popular "Delta" method for authorship attribution, thus enabling better analysis of its reliability in different circumstances. He was the principal investigator of an NSF-funded project to solidify the methodological bases of the field of automated authorship attribution, and is currently co-PI of the NSF-funded followup effort to create a standardized corpus for research in the area.
Expert has over 20 years of research experience in diverse areas of artificial intelligence and machine learning. His published research includes work on active learning, ensemble learning, learning with prior knowledge, robotic navigation, visual place recognition, shallow parsing, sentiment analysis, computational stylistics, and authorship attribution.
Expert has published numerous papers in many areas of natural language processing and computational linguistics, including shallow parsing, sentiment analysis, computational stylistics, genre analysis, text classification, part-of-speech tagging, information extraction, information retrieval, morphological analysis and rhetorical analysis. He has consulted to industry on projects in legal document processing, information retrieval, and business intelligence.
Expert consulted for AOL Research (2006), providing new algorithms and technology for development of a prototype system to predict emerging trends in web search. The system could predict in advance many of the most popular web searches for the following day, as well as give meaningful explanations for why people suddenly became interested in them.
Expert consulted for Radcom, Ltd. (1998-99) in Tel-Aviv, Israel, helping develop novel proprietary technology more more efficient analysis of Internet protocol traffic.
From 1995 to 1998, Expert had a long-term consulting arrangement with Expert-Ease Development, Inc., now part of Thomson Reuters. His primary contribution was the development of the patented text mining algorithms that form the core of the company's flagship product, Deal Proof (available from Thomson West). He also helped with the design and development of many other aspects of that product. Expert consulted for Compassware Development, Inc. (1995-1997), an information retrieval software company in Jerusalem, Israel. He developed new methods for effective information retrieval, document clustering, and text mining, as part of the company's InfoMagnet enterprise content management system.
Research interests:
Natural language processing and computational linguistics, particularly metaphor
analysis, systemic functional linguistics, shallow parsing, and discourse structure.
Computational stylistics, including text categorization by writing style, gender issues in
linguistic expression, forensic linguistics, and sociolinguistics, with applications to
humanities scholarship, cybersecurity, and counterterrorism.
Data mining and machine learning from very large textual corpora, including intelligent
text retrieval, categorization, summarization, document search, and pattern
recognition.
Scientic methodology, epistemology, and ethics; the relationship between the logical,
statistical, cognitive, and social structures in science and reproducibility and
reliability of scientic research and societal implications.
Digital humanities, particularly application of machine learning and natural language
processing to literary textual analysis and visualization.
Expert has been consulting since 1995.
Forensic linguistics:
2018: Corporate civil arbitration authorship analysis case. Case ongoing.
2018: Public Defender Service for District of Columbia.
Murder case, involving validation of authorship analysis of short online texts. Provided evaluation and validation of text analytic methodologies.
2014: Patent infringement case dealing with network security technology, specically
scanning for byte signatures in a data stream. Case dismissed before nal report
completed.
2012: Evaluation of whether a company's e-discovery procedures comply with discovery
requirements. Case settled before nal report completed.
2011: Analysis of the authorship of text messages and emails as evidence of conspiracy
and murder. Exploratory analysis performed, but no reports or testimony required.
2010: Authorship analysis of text messages containing death threats. Preliminary report
submitted; full report and testimony not required.
Education
Year | Degree | Subject | Institution |
---|---|---|---|
Year: 1994 | Degree: Ph.D. | Subject: Computer Science | Institution: Yale University |
Year: 1991 | Degree: MPhil | Subject: Computer Science | Institution: Yale University |
Year: 1988 | Degree: BS | Subject: Applied Mathematics | Institution: Carnegie-Mellon University |
Work History
Years | Employer | Title | Department |
---|---|---|---|
Years: 2002 to Present | Employer: Undisclosed | Title: Associate Professor | Department: Computer Science |
Responsibilities:Available upon request. |
|||
Years | Employer | Title | Department |
Years: 1999 to 2002 | Employer: Jerusalem College of Technology | Title: Assistant Professor | Department: Computer Science |
Responsibilities:Available upon request. |
|||
Years | Employer | Title | Department |
Years: 1996 to 1999 | Employer: Bar-Ilan University | Title: Research Scientist | Department: Mathematics and Computer Science |
Responsibilities:Available upon request. |
|||
Years | Employer | Title | Department |
Years: 1994 to 1996 | Employer: Bar-Ilan University | Title: Fulbright Postdoctoral Research Fellow | Department: Mathematics and Computer Scientists |
Responsibilities:Available upon request. |
|||
Years | Employer | Title | Department |
Years: 1988 to 1994 | Employer: Yale University | Title: Graduate Research Assistant | Department: Computer Science |
Responsibilities:Available upon request. |
|||
Years | Employer | Title | Department |
Years: 1985 to 1988 | Employer: Carnegie Mellon University | Title: Research Programmer | Department: Computer Science |
Responsibilities:Available upon request. |
International Experience
Years | Country / Region | Summary |
---|---|---|
Years: 1994 to 2002 | Country / Region: Israel | Summary: Expert lived and worked primarily in Israel from 1994 to 2002, in academic research and teaching positions as well as for a variety of consulting clients. |
Career Accomplishments
Associations / Societies |
---|
Fellow, British Computer Society (BCS); Senior Member, IEEE; Member, Association for Computing Machinery (ACM); Association for the Advancement of Articial Intelligence (AAAI); Association for Computational Linguistics (ACL); International Association of Forensic Linguists (IAFL), American Association for the Advancement of Science (AAAS). Corporate Advisory Boards: Knack (San Francisco; Israel); Sentisis (Madrid); Visual Process (Israel); AskWhai (Chicago). |
Professional Appointments |
---|
Research Funding Department of Energy: Deep-learning approaches for the analysis of synchrotron data of materials used in energy and environmental applications, Idaho National Laboratory subcontract IBM: Empirical Analysis of the Data Science Job Market Intelligence Community Postdoc: Computational Cognitive Stylistics for Multi-Modal Identity Analytics IARPA: Scalable Cross-Lingual Metaphor Identication and Conceptual Analysis NIH/NLM SBIR: Automated matching of relevant research studies to patient records for Evidence Based Medicine, Parity Computing, Inc. subcontract NSF CISE/CRI: Community Resources for Authorship Attribution Research (joint with Duquesne U.) NSF CISE/CRI: Community Resources for Authorship Attribution Research: REU Supplement Binational Science Foundation (BSF): Collaborative Proposal: Automated Text Analytic Methods to Assess Psychological Attributes of Authors (joint with Bar-Ilan University and University of Texas at Austin) ARDA Challenge Workshop: Complex Document Information Processing (CDIP) NSF CISE/CRI: Community Resources for Research in Automated Authorship Attribution (joint with Rutgers U.) IBM/EAS KDD Challenge: Meta-Learning and Computational Stylistics for Authorship Entity Recognition NSF CISE/EI: An Undergraduate CS Specialization in Knowledge Management Systems NSF CISE/EI: An Undergraduate CS Specialization in Knowledge Management Systems: REU Supplement Conference Organizing Cofounder: Chicago Colloquium on Digital Humanities and Computer Science; Program Chair, 2006; Organizational Cochair, 2007; General Chair 2009. Program Cochair: Big Data Computing and Communications, Taiyuan, China, 2015. Program Cochair: Biennial Israeli Symposium on Foundations of Articial Intelligence (BISFAI), 2007. Cochair, SIGIR 2006 Workshop | Stylistics for Text Retrieval in Practice, Seattle, WA, August 2006. Workshop Chair: ACM Conference on Information and Knowledge Management, 2004 and 2005. Cochair, SIGIR 2005 Workshop | Textual Stylistics in Information Access, Salvador, Brazil, August 2005. Cochair, AAAI 2004 Fall Symposium | Style and Meaning in Language, Art, Music, and Design, Washington, DC, October 22-24, 2004. Special session organizer: \Intelligent Text Processing" at the Eighth International Symposium on Articial Intelligence and Mathematics, January 2004. Chair, IJCAI-03 Workshop | Doing it With Style: Computational Approaches to Style Analysis and Synthesis, Acapulco, MX, August 10, 2003. Program Committees: NAACL/HLT: 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018 BigCom 2018: Big Data Computing and Communications, Chicago, IL, 2018. Twenty-Seventh AAAI Conference on Articial Intelligence, 2013. First Workshop on Metaphor in NLP, at NAACL 2013. International Workshop on Plagiarism Analysis, Authorship Identication, and Near-Duplicate Detection, at CLEF 2011, 2012, and 2013. EMNLP-10: Empirical Methods in Natural Language Processing, 2010. First International Workshop on Opinion Mining for Business Intelligence, 2010. SEPLN´09 Workshop PAN. Uncovering Plagiarism, Authorship and Social Software Misuse, 2009. Conference of the Association for Computational Linguistics (ACL), 2007. Association for Articial Intelligence, 2007. ACM SIGIR Conference on Research & Development on Information Retrieval, Senior Program Committee, 2008. ACM SIGIR Conference on Research & Development on Information Retrieval, 2006, 2007. Chicago Colloquium on Digital Humanities and Computer Science, 2006{2009. International Workshop on Plagiarism Analysis, Authorship Identication, and Near-Duplicate Detection, at ACM SIGIR 2007. 18th International Conference on Pattern Recognition (ICPR), 2006. International Conference on Human Language Technology { Empirical Methods in Natural Language Processing, 2005. International Joint Conference on Articial Intelligence, Senior Program Committee (poster track), Edinburgh, Scotland, July/August 2005. International Computational Systemic Functional Linguistics Conference, Sydney, Australia, July 2005. ACM Conference on Information and Knowledge Management, 2004, 2005. Biennial Israeli Symposium on Foundations of Articial Intelligence, Ramat Gan, Israel; 1999, 2005. |
Awards / Recognition |
---|
Fellowships & Awards Fellow of the British Computer Society, elected Distinguished Lecturer in Forensic Linguistics, Aston University Senior Fellow, Brain Sciences Foundation (Providence, RI), 2011{present. Senior Fellow, Center for Advanced Defense Studies (Washington, DC). Israel Science Foundation New Immigrant Scientist Fellow (Bar-Ilan University, Israel) Computational Design Fellow, Center for Computational Design, Rutgers Fulbright Postdoctoral Fellow (Bar-Ilan University, Israel), CIES/USIEF Fannie and John Hertz Foundation Graduate Fellow (Yale University) |
Publications and Patents Summary |
---|
He has 79 peer-reviewed publications and many others, and one patent. He is the editor of two current books: Computational Methods for Counterterrorism, and The Structure of Style. Publication statisticsy: Total citations: 8304 h-index: 38 Citations since 2014: 4019 Journals Founding Co-Chief Editor: Frontiers in Artificial Intelligence: Language and Computation, Frontiers. Associate editor: Register Studies, John Benjamin. Guest editor: Special topic section of Journal of the American Society for Information Sciences and Technology on Computational Analysis of Style Reviewing: Communications of the ACM, Computational Intelligence, Digital Humanities Quarterly, Foundations and Trends in Information Retrieval, IEEE Transactions on Internet Systems, IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Robotics and Automation,, Journal of Articial Intelligence Research, Journal of Computer-Mediated Communication, Journal of Machine Learning Research, Journal of the American Society for Information Sciences and Technology, Literary and Linguistic Computing, Machine Learning. |
Additional Experience
Training / Seminars |
---|
Academic Experience Employer, Chicago, IL. 2019{present: Chair (interim), Department of Computer Science 2013{present: Professor (tenured), Department of Computer Science 2013{present: Director and founder, Master of Data Science Developed/directed new interdisciplinary degree program; grew program to over 60 students in four years; developed diverse industrial partnerships. 2002{2013: Associate Professor (tenured), Department of Computer Science Jerusalem College of Technology, Jerusalem, Israel. 1999{2002: Senior Assistant Professor, Department of Computer Science Bar-Ilan University, Ramat Gan, Israel. 1999{2002: Research Aliate, Department of Mathematics and Computer Science 1996{1999: Research Scientist, Department of Mathematics and Computer Science 1994{1996: Fulbright Fellow, Department of Mathematics and Computer Science Rutgers University, Piscataway, NJ. Summer 1998: Visiting Research Scientist, Center for Computational Design. University of Chicago, Chicago, IL. Summer 1994: Postdoctoral Fellow, Department of Computer Science |
Language Skills
Language | Proficiency |
---|---|
Hebrew | Expert speaks and reads Hebrew fluently as a non-native, and writes Hebrew with good facility. |
Arabic | Expert is familiar with the principles of Arabic grammar and orthography, and has published research on computational analysis of the language. |
Fields of Expertise
plagiarism, forensics, forensic document examination, artificial intelligence, learning machine, semantic analysis, natural language text processing, computational linguistics, machine learning, terrorism, counter-terrorism, intelligence (information gathering), ghostwriting, academic fraud, biometric identification, competitive intelligence, parser software, computational method, linguistics, text retrieval, state space (artificial intelligence), document, information retrieval, research and development, artificial-intelligence programming language, formal language, language, science, artificial neural network, pattern recognition, computer science