Depends on who you ask. I am glad that the observability sector has standardized...

neonsunset · 2024-01-12T18:20:52 1705083652

C# has pretty nice integration with OTEL out of box (ASP.NET Core and otherwise, distributed as separate packages)

https://learn.microsoft.com/en-us/dotnet/core/diagnostics/ob...

yodon · 2024-01-13T00:42:31 1705106551

Coincidentally used in for real for the first time today. Saved me an insane amount of time in how easy it made finding the root case of a perf issue.

politelemon · 2024-01-12T18:13:18 1705083198

> my god are the reference implementations lacking.

Can you share some of your experience, what do you mean by that? Are there edge cases causing problems, or major missing features? Easy or difficult to use?

abeppu · 2024-01-12T19:48:07 1705088887

As an example, Exemplars are part of the metrics spec [1]. The official python library says metrics status is 'stable' [2]. But there's an approximately 2-year old issue with no work on it, titled 'Metrics: Add support for exemplars', where the latest update is that no work has begun [3]. Nothing at a top-level of the opentelemetry-python project indicates that the project does not implement everything in the metrics spec, so if you wanted to use that capability, you are apt to discover it relatively late.

[1] https://opentelemetry.io/docs/specs/otel/metrics/data-model/...

[2] https://github.com/open-telemetry/opentelemetry-python

[3] https://github.com/open-telemetry/opentelemetry-python/issue...

sph · 2024-01-13T07:01:30 1705129290

In the Elixir library, we don't even have metrics. I went into the OTel rabbit hole for two days trying to understand how is it better to Prometheus, just to learn it doesn't even do the basic thing, just traces.

I've mentally decided to just go Prometheus and ignore OpenTelemetry for the foreseeable future.

It's one of those things big players are hyping to preemptively lock you in their solution, but it's actually just alpha-quality new tech and "boring" "old" tech like Prometheus or statsd are simply more functional and better supported in the wild.

notpublic · 2024-01-13T16:12:16 1705162336

Elixir opentelemetry works quite well with Tempo. Tempo does the metric generation [1] and writes it to Prometheus. Tempo also does Service Graphs which works great with context propagation [2].

Btw, metric generation is not enabled in Tempo by default.

    # tempo.yml
    overrides:
      defaults:
        metrics_generator:
          processors: [service-graphs, span-metrics]
    
    # Prometheus
    --web.enable-remote-write-receiver
    
    # Grafana.yml
    [feature_toggles]
    enable = tempoSearch tempoBackendSearch traceToMetrics

[1] https://grafana.com/docs/tempo/latest/metrics-generator/ [2] https://hexdocs.pm/opentelemetry_process_propagator/Opentele...

filmor · 2024-01-13T07:50:59 1705132259

Metrics are implemented in the `opentelemetry_experimental` application. Last time I tried them, they were still a bit buggy but working (not complete, thiugh).

baby_souffle · 2024-01-13T04:51:40 1705121500

> Can you share some of your experience, what do you mean by that? Are there edge cases causing problems, or major missing features? Easy or difficult to use?

Just the general problem you get with big, slow moving OSS projects like this. Mostly just docs not current and a massive delta between certain languages; a feature is `stable` for some languages but not others which makes it hard to push for consistent otel roll out in a mixed-language environment.

Some other "misc" points:

- Google how to do $thing and you might find the proposed spec which gives example code ... that isn't what actually got implemented. That's a different link further down on your google results.

- Python auto-instrumentation is ... fragile at best. It's not super clear if instrumentation is supported only with well known frameworks or just ... in general. I'd sure love some docs that explain how it works, too.

- certain things require the collector use GRPC, others work with grpc or http... and I only found this out after googling an obscure error and reading through a _very_ long GH issue thread.

Thaxll · 2024-01-13T01:45:04 1705110304

I remember looking at the Go implementation, it did not look like Go code but it looked like someone was doing Java/C#.

arwineap · 2024-01-12T20:39:28 1705091968

otel logging is completely missing from golang for example

bbkane · 2024-01-12T22:42:31 1705099351

I agree this should be there, but I also think in most cases, logs can be completely replaced by otel tracing - see https://www.infoq.com/presentations/event-tracing-monitoring...

arwineap · 2024-01-12T23:19:12 1705101552

The example presented seems to also log, they just annotated the logs with span data

> What we ended up implementing was a little tee inside the o11y library. As well as sending events to Honeycomb, we also converted them to JSON, and wrote them to stdout. That way, after sending to stdout, we then pumped off to our standard log aggregation system. This way, we've got a fallback. If Honeycomb is not working, we can just see our logs normally. We could also send these off to S3 or some other long term storage system if we wanted.

I'd like to go a step further, and say that in addition to being worried about honeycomb being down, sometimes you just want to check with kubectl to get an idea what is going on.

Our current projects are very log light because of the heavy tracing instrumentation, but it'd be nice to integrate this with the otel paradigms as they were originally intended

eep_social · 2024-01-13T01:36:32 1705109792

trace spans are structured logs, they just also happen to correlate into a tree, ymmv

hinkley · 2024-01-13T18:11:46 1705169506

The silent failures by default.. I hate it.

So. Much.

Flames. Flames! On the sides of my face.

Breath… Heaving breaths.

pranay01 · 2024-01-12T18:15:46 1705083346

Agree, there being an open standard for instrumentation is a big win. Lots of work still needs to be done on showing more examples and making it more accessible to users & implementors.

One other key area is resources which can help get engineers/implementors to get organizational buy-in

hinkley · 2024-01-13T18:08:36 1705169316

Part of it is the spec. Stop letting Java people “help”. With friends like these who needs enemies. - ex Java person

cowgoesmoo · 2024-01-12T23:06:27 1705100787

Meh, I work in metrics observability and there's very little support for otel. Most new open source products are still based on Prometheus, which has much better SDK support than otel.

I think it's a mistake for Otel to do its own thing instead of just building on top of Prometheus.

MuffinFlavored · 2024-01-12T23:12:32 1705101152

Where it gets confusing:

https://grafana.com/docs/grafana-cloud/send-data/otlp/send-d...

hinkley · 2024-01-13T18:13:46 1705169626

I don’t agree with the communication patterns of either Prometheus or OpenTelemetry, but I’ll pick Prometheus next time I have to do telemetry. Unless there’s some fork of StatsD with tags that makes a resurgence.

wdb · 2024-01-13T12:29:44 1705148984

Prometheus supports the OTLP format

hinkley · 2024-01-13T18:15:48 1705169748

But OTLP is still hot garbage right now. If you send otlp to Prometheus it might get there, or it might all end up being dropped by a parsing error, because otelcollector is dumber than a box of hammers that have been through a rock tumbler.