TIL that the ascii_folding token filter in #elasticsearch handle a lot more than just ASCII 😳
I will continue to use icu_folding, anyway! 😜
Look at the code: github.com/apache/lucen...
#lucene #java #icu
#MissKitty #Bluesky AT #Protocol Babble
I have run four #prompts already, and I'm waiting for the answer from an add-on question. It's not as easy as simple #Google search to find things if you want to be exhaustive and not just lucky on Bluesky. I've never heard of #lucene modifiers. My head hurts.
📢 Apache Lucene 10.4.0 is out!
✅ Many lucene queries should see a performance improvement of 10-15%
✅ A new scalar quantized format for dense vectors and knn search.
#Lucene sits at the heart of so many #search platforms, including @opensearch.org, that stand to benefit from this release.
I've just released a new version of my #Java #Lucene #MCP server. Integrate #fulltext #search for local files directly into #Claude Desktop. search documents, perform drilldown-queries and much more. Bonus: includes a MCP #App for administrative tasks. Checkout github.com/mirkosertic/....
I've release a #Java #Apache #Lucene #MCP server for local #fulltext #search from within your #Claude Desktop session. The server supports a chat based interface to index local directories and query them in a #conversational way. Checkout github.com/mirkosertic/... for more!
🔴LIVE | [ แสร้งว่า ASMR🎧 ] ของขวัญปีใหม่นี้ รับพี่สาวสักคนไหมคะ💭 | LUCENE 🌜
#LUCENE #LuceneCh #VTuber
Uber Adopts Amazon OpenSearch for Semantic Search to Better Capture User Intent To improve search and recommendation user experiences, Uber migrated from Apache Lucene to Amazon OpenSearch to suppo...
#Lucene #Uber #Amazon #vector #databases #Apache #OpenSearch […]
[Original post on infoq.com]
Uber Adopts Amazon OpenSearch for Semantic Search to Better Capture User Intent To improve search and recommendation user experiences, Uber migrated from Apache Lucene to Amazon OpenSearch to suppo...
#Uber #Amazon #Lucene #Apache #vector #databases #OpenSearch […]
[Original post on infoq.com]
The upcoming #OpenSearchCon Europe will have a designated "Search & Apache Lucene" track.
If you're involved in Apache #Lucene project, or the broader search and relevancy ecosystem, I encourage you to submit a talk:
events.linuxfoundation.org/opensearchco...
@opensearch.org @apache.org #search
I found some time to update #JavaFX #DesktopSearch to the latest #Java 25, #Lucene and #Tika releases. Maybe I will also add some #LLM or #MCP features. We'll find out. Checkout github.com/mirkosertic/... for more to come :-)
🚀 Introducing Lucene-on-Faiss
⚡ 2x boost in search throughput
💡 Decrease memory limitations
📖 Blog: opensearch.org/blog/lucene-...
#OpenSearch #VectorSearch #Lucene #Faiss #AI #GenerativeAI #ANN #SearchTech
#Lucene just switched from a binary heap to a ternary heap to collect top hits by score. This helps a small bit when computing top-100 hits (~2% on the fastest queries) but up to 15% when computing top-1000 hits - thanks to better cache efficiency github.com/apache/lucen...
Lucenia 0.6.1 just dropped.
✅ Ingest attachments (PDF, Word, more)
☁️ Faster remote-backed indexing
💻 Ubuntu + Mac (Windows soon)
⚡ Lucene 10.2 boost
AI-retrieval — now faster than ever.
🔗 lucenia.io/2025/07/31/i...
#retrieval #devops #cio #ciso #search #ai #aitools #searchai #lucenia #lucene
OpenSearch 3.0 Now Generally Available, with a Focus on Vector Database Performance and Scalabili...
www.infoq.com/news/2025/05/opensearch-...
#Lucene #OpenSearch […]
[Original post on infoq.com]
OpenSearch 3.0 Now Generally Available, with a Focus on Vector Database Performance and Scalabili...
www.infoq.com/news/2025/05/opensearch-...
#AWS #OpenSearch […]
[Original post on infoq.com]
ElasticSearch — Inverted Index, Source, Index, Norms, Routing… - Lucene vs ElasticSearch, Details about Inverted Index, source-index-norms-routing Explained in Detail #elasticsearch #lucene #invertedindex #routing #indexing senoritadeveloper.medium.com/elasticsearc...
Indexing in Zimbra - How E-Mail Content is Indexed in Zimbra #zimbra #zimbracollaboration #indexing #lucene #elasticsearch senoritadeveloper.medium.com/indexing-in-...
I'm glad to share that Luca Cavanna and I will be speaking about the #Lucene 10.0 release at Berlin Buzzwords in June.
It's time to redo benchmarks! #Lucene 10.2 was just released, with
- huge speedups to non-scoring boolean queries, range queries and filtered vector search,
- better merging defaults for faster search,
- much faster merging of vectors
And more...
lucene.apache.org/core/corenew...
Indexing and merging times are getting better for #Apache #Lucene vector search. Lucene has a read-only segment architecture. One of the drawbacks of this approach is throwing away previously completed work when merging HNSW graphs. Well, this got better :)
Something that I wish was better known: if you do not configure an index sort on your #Lucene indexes, you are missing search-time efficiency benefits that are almost certainly worth the (low) index-time overhead.
omg we've gone full circle and Information Retrieval is back to boolean operators. #lucene #solr #sql #nltk #askjeeves
meetup
talk
nvidia GTC is coming to the bay area next week. we'll be there with a
* talk about bringing #lucene to the GPU
* a "guess that prompt" meetup between galileo + UnstructuredIO + elastic. join us to outsmart AI ;)
lu.ma/guess-that-p...
talk
meetup
nvidia GTC is coming to the bay area next week. we'll be there with a
* talk about bringing #lucene to the GPU
* a "guess that prompt" meetup between galileo + UnstructuredIO + elastic. join us to outsmart AI ;)
https://lu.ma/guess-that-prompt
Apache Solr vs OpenSearch - Comparison and Key Differences OpenSearch and Apache Solr are both bu...
bigdataboutique.com/blog/apache-solr-vs-open...
#OpenSearch #Apache #Solr #Apache #Lucene
Event Attributes