You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
in Solr, you are able to retrieve it in query results even if it's not stored, docValues is used instead. This works in spark-solr, only not with multiValued=true fields.
SolrJ and regular solr api can provide such fields, but when we use them with spark-solr:
val s1 = Map(
"zkHost" -> "myZK",
"collection" -> "myCollection",
"query" -> "multivaluedField:[* TO *]",
"fields" -> "multivaluedField",
"max_rows" -> "100000",
"flatten_multivalued"-> "false"
)
val data = spark.read.format("solr").options(s1).load
data.createOrReplaceTempView("myTable")
Results with: data: org.apache.spark.sql.DataFrame = [id: string]
Notice that multiValuedField is not resolved.
This is a serious issue in my opinion, because it prohibits you from using streaming method when you need multiValued fields in an RDD.
The text was updated successfully, but these errors were encountered:
Using Solr 8.4.0, Spark-Solr 3.6.1 Spark: 2.11
When a field is configured with:
stored="false" docValues="true" useDocValuesAsStored="true"
in Solr, you are able to retrieve it in query results even if it's not stored, docValues is used instead. This works in spark-solr, only not with multiValued=true fields.
SolrJ and regular solr api can provide such fields, but when we use them with spark-solr:
Results with:
data: org.apache.spark.sql.DataFrame = [id: string]
Notice that multiValuedField is not resolved.
This is a serious issue in my opinion, because it prohibits you from using streaming method when you need multiValued fields in an RDD.
The text was updated successfully, but these errors were encountered: