Enterprise Search is a challenging problem for most organizations. Public search technologies such as Google can index content and use link popularity to rank content in addition to the basic keyword matches. Enterprise Search is different. Sometimes it requires specially designed indexes as well as several processing steps.
At the U.S. Patent & Trademark Office, part of the Department of Commerce, a team of professionals is building the next generation of search tools using open source technologies. Like any large undertaking, it’s not a simple plug and play project.
Main topics covered in this talk:
+ Architectures for Large Scale Enterprise Search
+ Leveraging Apache Cassandra & Spark
+ Customizing / Configuring Apache SolR and Indexing
+ Writing a custom Parser for SolR in Scala
Event Sponsor: http://www.meetup.com/ACM-DC
ACM Washington DC is a non-profit professional group that meets regularly to discuss diverse topics in computer science such as predictive analytics, applied machine learning, statistical modeling, open data, data visualization, user experience, user research, and artificial neural networks. Meeting topics are varied and range from tutorials on basic concepts and their applications, to success stories from local practitioners and academic students, to discussions of tools, new technologies, and best practices. All are welcome to attend, to meet others, and to present their work.
ACM Washington DC is officially a part of the ACM Local project of Association for Computing Machinery, Inc.